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Enterococcus faecalis polynucleotides and polypeptides 

Field of the Invention 

The present invention relates to novel Enterococcus faecalis genes (E. faecalis) 
5 nucleic acids and polypeptides. Also provided are vectors, host cells and recombinant 
methods for producing the same. Further provided are diagnostic methods for 
detecting Enterococcus faecalis using probes, primers, and antibodies to the E. faecalis 
nucleic acids and polypeptides of the present invention. The invention further relates 
to screening methods for identifying agonists and antagonists of E. faecalis 
10 polypeptide activity and to vaccines using E. faecalis nucleic acids and polypeptides. 

Background of the Invention 

Enterococci have been recognized as being pathogenic for humans since the 
turn of the century when they were first described by Thiercelin in 1988 as 

15 microscopic organisms. The genus Enterococcus includes the species Enterococcus 
faecalis or E. faecalis which is the most common pathogen in the group, accounting for 
80 - 90 percent of all enterococcal infections. See Lewis et al. (1990) Eur J. Clin 
Microbiol Infect Dis.9:l 11-117. 

The incidence of enterococcal infections has increased in recent years and 

20 enterococci are now the second most frequently reported nosocomial pathogens. 

Enterococcal infection is of particular concern because of its resistance to antibiotics. 
Recent attention has focused on enterococci not only because of their increasing role in 
nosocomial infections, but also because of their remarkable and increasing resistance to 
antimicrobial agents. These factors are mutually reinforcing since resistance allows 

25 enterococci to survive in an environment in which antimicrobial agents are heavily 
used; the hospital setting provides the antibiotics which eliminate or suppress 
susceptible bacteria, thereby providing a selective advantage for resistant organisms, 
and the hospital also provides the potential for dissemination of resistant enterococci 
via the usual routes of hand and environmental contamination. 
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Antimicrobial resistance can be divided into two general types, inherent or 
intrinsic property and that which is acquired. The genes for intrinsic resistance, like 
other species characteristics, appear to reside on the chromosome. Acquired 
resistance results from either a mutation in the existing DNA or acquisition of new 
5 DNA. The various inherent traits expressed by enterococci include resistance to 
semisynthetic penicillinase-resistant penicillins, cephalosporins, low levels of 
aminoglycosides, and low levels of clindamycin. Examples of acquired resistance 
include resistance to chloramphenicol, erythromycin, high levels of clindamycin, 
tetracycline, high levels of aminoglycosides, penicillin by means of penicillinase, 

10 fluoroquinolones, and vancomycin. Resistance to high levels of penicillin without 
penicillinase and resistance to fluoroquinolones are not known to be plasmid or 
transposon mediated and presumably are due to mutation(s). 

Although the main reservoir for enterococci in humans is the gastrointestinal 
tract, the bacteria can also reside in the gallbladder, urethra and vagina. 

1 5 E. faecalis has emerged as an important pathogen in endocarditis, bacteremia, 

urinary tract infections (UTls), intraabdominal infections, soft tissue infections, and 
neonatal sepsis. See Lewis et al. (1990) supra.. In the 1970s and 1980s enterococci 
became firmly established as major nosocomial pathogens. They are now the fourth 
leading cause of hospital-acquired infection and the third leading cause of bacteremia in 

20 the United States. Fatality ratios for enterococcal bactermia range from 1 2% to 68%, 
with death due to enterococcal sepsis in 4 to 50% of these cases. See T.G. Emori 
(1993) Clin. Microbiol. Rev. 6:428-442. 

The ability of enterococci to colonize the gastrointestinal tract, plus the many 
intrinsic and acquired resistance traits, means that these organisms, which usually 

25 seem to have relatively low intrinsic virulence, are given an excellent opportunity to 
become secondary invaders. Since nosocomial isolates of enterococci have displayed 
resistance to essentially every useful antimicrobial agent, it will likely become 
increasingly difficult to successfully treat and control enterococcal infections. 
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Particularly when the various resistance genes come together in a single strain, an 
event almost certain to occur at some time in the future. 

The etiology of diseases mediated or exacerbated by Enterococcus faecalis, 
involves the programmed expression of E. faecalis genes, and that characterizing these 
5 genes and their patterns of expression would dramatically add to our understanding of 
the organism and its host interactions. Knowledge of the E. faecalis gene and genomic 
organization would improve our understanding of disease etiology and lead to 
improved and new ways of preventing, treating and diagnosing diseases. Thus, there 
is a need to characterize the genome of E. faecalis and for polynucleotides of this 
10 organism. 

Summary of the Invention 

The present invention provides for isolated E. faecalis polynucleotides and 
polypeptides shown in Table 1 and SEQ ID NO: 1 through SEQ ID NO:496 

15 (polynucleotide sequences having odd SEQ ID NOs and polypeptide sequences 

having even SEQ ID NOs). One aspect of the invention provides isolated nucleic acid 
molecules comprising polynucleotides having a nucleotide sequence selected from the 
group consisting of: (a) a nucleotide sequence shown in Table 1 ; (b) a nucleotide 
sequence encoding any of the amino acid sequences of the polypeptides shown in 

20 Table 1 ; and (c) a nucleotide sequence complementary to any of the nucleotide 

sequences in (a) or (b). The invention further provides for fragments of the nucleic 
acid molecules of (a), (b) & (c) above. 

Further embodiments of the invention include isolated nucleic acid molecules 
that comprise a polynucleotide having a nucleotide sequence at least 90% identical, 

25 and more preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the 
nucleotide sequences in (a), (b) or (c) above, or a polynucleotide which hybridizes 
under stringent hybridization conditions to a polynucleotide in (a), (b) or (c) above. 
Additional nucleic acid embodiments of the invention relate to isolated nucleic acid 
molecules comprising polynucleotides which encode the amino acid sequences of 
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epitope-bearing portions of a E,faecalis polypeptide having an amino acid sequence in 
(a) above. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing 
5 the recombinant vectors, as well as to methods of making such vectors and host cells. 
The present invention further relates to the use of these vectors in the production of 
E. faecalis polypeptides or peptides by recombinant techniques. 

The invention further provides isolated E. faecalis polypeptides having an 
amino acid sequence selected from the group consisting of an amino acid sequence of 
10 any of the polypeptides described in Table 1 or fragments thereof. 

The polypeptides of the present invention also include polypeptides having 
an amino acid sequence with at least 70% similarity, and more preferably at least 75%, 
80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in Table 
1 , as well as polypeptides having an amino acid sequence at least 70% identical, more 
15 preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 95%, 
96%, 97%, 98%, or 99% identical to those above; as well as isolated nucleic acid 
molecules encoding such polypeptides. 

The present invention further provides a single or multi-component vaccine 
comprising one or more of the E. faecalis polynucleotides or polypeptides described 
20 in Table 1 , or fragments thereof, together with a pharmaceutically acceptable diluent, 
carrier, or excipient, wherein the E. faecalis polypeptide(s) are present in an amount 
effective to elicit an immune response to members of the Enterococcus genus, or at 
least E. faecalis , in an animal. The E. faecalis polypeptides of the present invention 
may further be combined with one or more immunogens of one or more other 
25 Enterococcal or non-Enterococcal organisms to produce a multi-component vaccine 
intended to elicit an immunological response against members of the Enterococcus 
genus and, optionally, one or more non-Enterococcal organisms. 

The vaccines of the present invention can be administered in a DNA form, e.g., 
"naked" DNA, wherein the DNA encodes one or more Enterococcal polypeptides 
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and, optionally, one or more polypeptides of a non-Enterococcal organism. The DNA 
encoding one or more polypeptides may be constructed such that these polypeptides 
are expressed as fusion proteins. 

The vaccines of the present invention may also be administered as a 
5 component of a genetically engineered organism or host cell. Thus, a genetically 

engineered organism or host cell which expresses one or more E.faecalis polypeptides 
may be administered to an animal. For example, such a genetically engineered 
organism or host cell may contain one or more E.faecalis polypeptides of the present 
invention intracellularly, on its cell surface, or in its periplasmic space. Further, such 
1 0 a genetically engineered organism or host cell may secrete one or more E. faecalis 

polypeptides. The vaccines of the present invention may also be co-administered to 
an animal with an immune system modulator (e.g., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological response 
in an animal to one or more members of the Enterococcus genus, preferably one or 
15 more isolates of the E.faecalis species, comprising administering to the animal a 
vaccine as described above. 

The invention further provides a method of inducing a protective immune 
response in an animal, sufficient to prevent, attenuate, or control an infection by 
members of the Enterococcus genus, preferably at least E.faecalis species, 
20 comprising administering to the animal a composition comprising one or more of the 
polynucleotides or polypeptides described in Table 1, or fragments thereof. Further, 
these polypeptides, or fragments thereof, may be conjugated to another immunogen 
and/or administered in admixture with an adjuvant. 

The invention further relates to antibodies elicited in an animal by the 
25 administration of one or more E. faecalis polypeptides of the present invention and to 
methods for producing such antibodies and fragments thereof. The invention further 
relates to recombinant antibodies and fragments thereof and to methods for producing 
such antibodies and fragments thereof. 

The invention also provides diagnostic methods for detecting the expression of 
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the polynucleotides of Table 1 by members of the Enterococcus genus in an animal. 
One such method involves assaying for the expression of a polynucleotide encoding 
E.faecalis polypeptides in a sample from an animal. This expression may be assayed 
either directly {e.g., by assaying polypeptide levels using antibodies elicited in 

5 response to amino acid sequences described in Table 1) or indirectly (e.g., by assaying 
for antibodies having specificity for amino acid sequences described in Table 1). The 
expression of polynucleotides can also be assayed by detecting the nucleic acids of 
Table 1 . An example of such a method involves the use of the polymerase chain 
reaction (PCR) to amplify and detect Enterococcus nucleic acid sequences. 

10 The present invention also relates to nucleic acid probes having all or part of a 

nucleotide sequence described in Table 1 (odd SEQ ID NOs) which are capable of 
hybridizing under stringent conditions to Enterococcus nucleic acids. The invention 
further relates to a method of detecting one or more Enterococcus nucleic acids in a 
biological sample obtained from an animal, said one or more nucleic acids encoding 

15 Enterococcus polypeptides, comprising: (a) contacting the sample with one or more 
of the above-described nucleic acid probes, under conditions such that hybridization 
occurs, and (b) detecting hybridization of said one or more probes to the Enterococcus 
nucleic acid present in the biological sample. 

Other uses of the polypeptides of the present invention include: inter alia, to 

20 detect E. feecalis in immunoassays, as epitope tags, as molecular weight markers on 
SDS-PAGE gels, as molecular weight markers for molecular sieve gel filtration 
columns, to generate antibodies that specificaly bind E.faecalis polypeotides of the 
present invention for the detection E.faecalis in immunoassays, to generate an 
immune response against E. faecalis and other Enterococcus species, and as vaccines 

25 against E. faecalis, other Enterococcus species and other bacteria genuses. 

Isolated nucleic acid molecules of the present invention, particularly DNA 
molecules, are useful as probes for gene mapping and for identifying E.faecalis in a 
biological samples, for instance, by Southern and Northern blot analysis. 
Polynucleotides of the present invention are also useful in detecting E.faecalis by 



WO 98/50554 



-7- 



PCT/US98/08959 



PCR using primers for a particular E. faecalis polynucleotide. Isolated 
polynucleotides of the present invention are also useful in making the polypeptides of 
the present invention. 

5 Detailed Description 

The present invention relates to recombinant E. faecalis nucleic acids and 
fragments thereof. The present invention further relates to recombinant E. faecalis 
polypeptides and fragments thereof. The invention also relates to methods for using 
these polypeptides to produce immunological responses and to confer immunological 

10 protection to disease caused by members of the genus Enterococcus, at least isolates 
of the E. faecalis genus. The invention further relates to nucleic acid sequences which 
encode antigenic E. faecalis polypeptides and to methods for detecting E. faecalis 
nucleic acids and polypeptides in biological samples. The invention also relates to 
antibodies specific for the polypeptides and peptides of the present invention and 

15 methods for detecting such antibodies produced in a host animal. 

Definitions 

The following definitions are provided to clarify the subject matter which the 
inventors consider to be the present invention. 
20 As used herein, the phrase "pathogenic agent" means an agent which causes a 

disease state or affliction in an animal. Included within this definition, for examples, 
are bacteria, protozoans, fungi, viruses and metazoan parasites which either produce a 
disease state or render an animal infected with such an organism susceptible to a 
disease state (e.g., a secondary infection). Further included are species and strains of 
25 the genus Enterococcus which produce disease states in animals. 

As used herein, the term "organism" means any living biological system, 
including viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term "Enterococcus" means any species or strain of 
bacteria which is members of the genus Enterococcus. Such species and strains are 



WO 98/50554 



-8- 



PCT/US98/08959 



known to those of skill in the art, and include those that are pathogenic and those that 
are not. 

As used herein, the phrase "one or more E. faecalis polypeptides of the 
present invention" means polypeptides comprising the amino acid sequence of one or 

5 more of the E. faecalis polypeptides described in Table 1 (even SEQ ID NOs). These 
polypeptides may be expressed as fusion proteins wherein the E. faecalis 
polypeptides of the present invention are linked to additional amino acid sequences 
which may be of Enterococcal or non-Enterococcal origin. This phrase further 
includes polypeptide comprising fragments of the E, faecalis polypeptides of the 

10 present invention. Additional definitions are provided throughout the specification. 

Explanation of Table 1 

Table 1, below, provides information describing genes which encode 
polypeptides of E. faecalis. The table lists the gene identifier which consists of the 

15 letters EF, which denote E. faecalis, followed immediately by a three digit numeric 
code, which arbitrarily number the E. faecalis genes of the present invention. A 
number from 1 through 4 follows the three digit number. A number 1 represents the 
full length open reading frame of the gene specified by the preceeding three digit 
number. A number 2 represents the full length polypeptide encoded by the gene 

20 specified the preceeding three digit number. A number 3 represents a polynucleotide 
fragment, of the gene represented by the preceeding three digit number, used to 
produce an antigenic polypeptide. A number 4 represents an antigenic polypeptide 
fragment , of the gene represented by the preceeding three digit number, used to 
stimulate an immune response or as a vaccine. The nucleotide and amino acid 

25 sequences of each gene and fragment are also shown in the Sequence Listing under the 
SEQ ID NO listed in Table 1. 

Explanation of Table 2 

Table 2 lists accession numbers for the closest matching sequences between 
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the polypeptides of the present invention and those available through GenBank and 
Derwent databases. These reference numbers are the database entry numbers 
commonly used by those of skill in the art, who will be familar with their 
denominations. The descriptions of the numenclature for GenBank are available from 
5 the National Center for Biotechnology Information. Column 1 lists the gene or ORF 
of the present invention. Column 2 lists the accession number of a "match" gene 
sequence in GenBank or Derwent databases. Column 3 lists the description of the 
"match" gene sequence. Columns 4 and 5 are the high score and smallest sum 
probability, respectively, calculated by BLAST. Polypeptides of the present 
10 invention that do not share significant identity/similarity with any polypeptide 

sequences of GenBank and Derwent are not represented in Table 2. Polypeptides of 
the present invention that share significant identity/similarity with more than one of 
the polypeptides of GenBank and Derwent are represented more than once. 

1 5 Explanation of Table 3. 

The E. faecalis polypeptides of the present invention may include one or more 
conservative amino acid substitutions from natural mutations or human manipulation 
as indicated in Table 3. Changes are preferably of a minor nature, such as conservative 
amino acid substitutions that do not significantly affect the folding or activity of the 

20 protein. Residues from the following groups, as indicated in Table 3, may be 

substituted for one another: Aromatic, Hydrophobic, Polar, Basic, Acidic, and Small, 

Explanation of Table 4 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope- 
25 bearing fragments present in each of the full length E. faecalis polypeptides described 
in Table 1 as predicted by the inventors using the algorithm of Jameson and Wolf, 
(1988) Comp. Appl. Biosci. 4:181-186. The Jameson-Wolf antigenic analysis was 
performed using the computer program PROTEAN (Version 3:1 1 for the Power 
Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, WI). E. faecalis 
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polypeptide shown in Table 1 may one or more antigenic epitopes comprising 
residues described in Table 4. It will be appreciated that depending on the analytical 
criteria used to predict antigenic determinants, the exact address of the determinant 
may vary slightly. The residues and locations shown described in Table 4 correspond 
5 to the amino acid sequences for each full length gene sequence shown in Table 1 and in 
the Sequence Listing. Polypeptides of the present invention that do not have 
antigenic epitopes recognized by the Jameson-Wolf algorithm are not represented in 
Table 2. 

1 0 Selection of Nucleic Acid Sequences Encoding Antigenic E. faecalis Polypeptides 

Sequenced E, faecalis genomic DNA was obtained from the E. faecalis strain 
V586. The E. faecalis strain V586 was deposited 2 May 1997 at the ATCC, 10801 
University Blvd. Manassas, VA 201 10-2209, and given accession number 55969. 
Some ORFs contained in the subset of fragments of the E. faecalis genome 

15 disclosed herein were derived through the use of a number of screening criteria detailed 
below. The ORFs are bounded at the amino terminus by a methionine or valine 
residue and usually at the carboxy terminus by a stop codon. 

Most of the selected sequences consist of complete ORFs. The polypeptides 
that do not comprise a complete ORF can be determined by determining whether the 

20 corresponding polynucleotide sequence comprises a stop codon after the codon for 
the last amino acid residue in the polypeptide sequence. It is not always preferred to 
express a complete ORF in a heterologous system. It may be challenging to express 
and purify a highly hydrophobic protein by common laboratory methods. Some of 
the polypeptide vaccine candidates described herein have been modified slightly to 

25 simplify the production of recombinant protein. For example, nucleotide sequences 
which encode highly hydrophobic domains, such as those found at the amino terminal 
signal sequence, have been excluded from some constructs used for expression of the 
polypeptides. Furthermore, any highly hydrophobic amino acid sequences occurring 
at the carboxy terminus have also been excluded from the recombinant expression 
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constructs. Thus, in one embodiment, a polypeptide which represents a truncated or 
modified ORF may be used as an antigen. 

While numerous methods are known in the art for selecting potentially 
immunogenic polypeptides, many of the ORFs disclosed herein were selected on the 
5 basis of screening Enterococcus faecalis ORFs for several aspects of potential 
imrnunogenicity. One set of selection criteria are as follows: 

1 . Type I signal sequence: An amino terminal type 1 signal sequence generally 
directs a nascent protein across the plasma and outer membranes to the exterior of the 
bacterial cell. Experimental evidence obtained from studies with Escherichia coli 

10 suggests that the typical type I signal sequence consists of the following biochemical 
and physical attributes (Izard, J. W. and Kendall, D, A. Mol Microbiol 13:165-112 
(1994)). The length of the type I signal sequence is approximately 15 to 25 primarily 
hydrophobic amino acid residues with a net positive charge in the extreme amino 
terminus. In addition, the central region of the signal sequence adopts an alpha-helical 

15 conformation in a hydrophobic environment. Finally, the region surrounding the 

actual site of cleavage is ideally six residues long, with small side-chain amino acids in 
the - 1 and -3 positions. 

2. Type IV signal sequence: The type IV signal sequence is an example of the 
several types of functional signal sequences which exist in addition to the type I signal 

20 sequence detailed above. Although functionally related, the type IV signal sequence 
possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, 
S.,J.Bacteriol. 774:7345-7351 (1992)). These are typically six to eight amino acids 
with a net basic charge followed by an additional sixteen to thirty primarily 
hydrophobic residues. The cleavage site of a type IV signal sequence is typically after 

25 the initial six to eight amino acids at the extreme amino terminus. In addition, type IV 
signal sequences generally contain a phenylalanine residue at the +1 site relative to the 
cleavage site. 

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial 
lipoprotein precursors has allowed the definition of a consensus amino acid sequence 
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for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors 
examined contained the sequence L-(A,S)-(G,A)-C at positions -3 to +1, relative to 
the point of cleavage (Hayashi, S. and Wu, H. C, J. Bioenerg. Biomembr. 22:451-471 
(1990)). 

5 4. LPXTG motif: It has been experimentally determined that most anchored 

proteins found on the surface of gram-positive bacteria possess a highly conserved 
carboxy terminal sequence. More than fifty such proteins from organisms such as 5. 
pyogenes, S. mutans, E, faecalis, S. pneumoniae, and others, have been identified based 
on their extracellular location and carboxy terminal amino acid sequence (Fischetti, V. 

10 A., ASM News 62:405-410 (1996)). The conserved region consists of six charged 
amino acids at the extreme carboxy terminus coupled to 15-20 hydrophobic amino 
acids presumed to function as a transmembrane domain. Immediately adjacent to the 
transmembrane domain is a six amino acid sequence conserved in nearly all proteins 
examined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any 

15 amino acid. 

An algorithm for selecting antigenic and immunogenic Enterococcus faecalis 
polypeptides including the foregoing criteria was developed. The algorithm is similar 
to that described in U.S. patent application 08/781,986, filed January 3, 1997, which 
is fully incorporated by reference herein. Use of the algorithm by the inventors to 
20 select immunologically useful Enterococcus faecalis polypeptides resulted in the 
selection of a number of the disclosed ORFs. Polypeptides comprising the 
polypeptides identified in this group may be produced by techniques standard in the 
art and as further described herein. 

25 Nucleic A cid Molecules 

Sequenced E. faecalis genomic DNA was obtained from the E. faecalis strainV586. As 
discussed elsewhere hererin, polynucleotides of the present invention readily may be 
obtained by routine application of well known and standard procedures for cloning 
and sequencing DNA. Detailed methods for obtaining libraries and for sequencing are 
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provided below, for instance. A wide variety of Enterococcus faecalis strains that can 
be used to prepare E. faecalis genomic DNA for cloning and for obtaining 
polynucleotides and polypeptides of the present invention. A wide variety of 
Enterococcus faecalis strains are available to the public from recognized depository 

5 institutions, such as the American Type Culture Collection (ATCC). It is recognized 
that minor variation is the nucleic acid and amino acid sequence may be expected from 
E faecalis strain to strain. The present invention provides for genes, including both 
polynucleotides and polypeptides, of the of the present invention from all the 
Enterococcus faecalis strains. 

10 Unless otherwise indicated, all nucleotide sequences determined by sequencing 

a DNA molecule herein were determined using an automated DNA sequencer (such as 
the Model 373 from Applied Biosystems, Inc., Foster City, CA), and all amino acid 
sequences of polypeptides encoded by DNA molecules determined herein were 
predicted by translation of a DNA sequence determined as above. Therefore, as is 

15 known in the art for any DNA sequence determined by this automated approach, any 
nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more 
typically at least about 95% to at least about 99.9% identical to the actual nucleotide 
sequence of the sequenced DNA molecule. The actual sequence can be more 

20 precisely determined by other approaches including manual DNA sequencing methods 
well known in the art. As is also known in the art, a single insertion or deletion in a 
determined nucleotide sequence compared to the actual sequence will cause a frame 
shift in translation of the nucleotide sequence such that the predicted amino acid 
sequence encoded by a determined nucleotide sequence will be completely different 

25 from the amino acid sequence actually encoded by the sequenced DNA molecule, 
beginning at the point of such an insertion or deletion. In case of conflict between 
Table 1 and either the nucleic acid sequence of the clones listed in Table 1 or the amino 
acid sequence of the protein expressed by the clones listed in Table 1, the clones listed 
in Table 1 are controlling. By "nucleotide sequence" of a nucleic acid molecule or 
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polynucleotide is intended to mean either a DNA or RNA sequence.Using the 
information provided herein, such as the nucleotide sequence in Table 1 , a nucleic acid 
molecule of the present invention encoding a E. faecalis polypeptide may be obtained 
using standard cloning and screening procedures, such as those for cloning DNAs 

5 using genomic DNA as starting material. See, e.g., Sambrook et al. MOLECULAR 
CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 
1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY 
(John Wiley and Sons, N.Y. 1989). Illustrative of the invention, the nucleic acid 
molecule described in Table 1 was discovered in a DNA library derived from a E. 

1 0 faecalis genomic DNA. 

Nucleic acid molecules of the present invention may be in the form of RNA, 
such as mRNA, or in the form of DNA, including, for instance, DNA and genomic 
DNA obtained by cloning or produced synthetically. The DNA may be 
double-stranded or single-stranded. Single-stranded DNA or RNA may be the coding 

15 strand, also known as the sense strand, or it may be the non-coding strand, also 
referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. This includes 
segments of DNA comprising the E. faecalis polynucleotides of the present invention 

20 isolated from the native chromosome. These fragments include both isolated 

fragments consisting only of E. faecalis DNA and fragments comprising heterologous 
sequences such as vector sequences or other foreign DNA. For example, recombinant 
DNA molecules contained in a vector are considered isolated for the purposes of the 
present invention. Further examples of isolated DNA molecules include recombinant 

25 DNA molecules maintained in heterologous host cells or purified (partially or 

substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or 
in vitro RNA transcripts of the DNA molecules of the present invention. Isolated 
nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 
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In addition, isolated nucleic acid molecules of the invention include DNA 
molecules which comprise a sequence substantially different from those described 
above but which, due to the degeneracy of the genetic code, still encode a E.faecalis 
polypeptides and peptides of the present invention (e.g. polypeptides of Table 1). 
5 That is, all possible DNA sequences that encode the E.faecalis polypeptides of the 
present invention. This includes the genetic code and species-specific codon 
preferences known in the art. Thus, it would be routine for one skilled in the art to 
generate the degenerate variants described above, for instance, to optimize codon 
expression for a particular host (e.g., change codons in the bacteria mRNA to those 

10 preferred by a mammalian or other bacterial host such as E. coli). 

The invention further provides isolated nucleic acid molecules having the 
nucleotide sequence shown in Table 1 or a nucleic acid molecule having a sequence 
complementary to one of the above sequences. Such isolated molecules, particularly 
DNA molecules, are useful as probes for gene mapping and for identifying E.faecalis 

15 in a biological sample, for instance, by PCR, Southern blot, Northern blot, or other 
form of hybridization analysis. 

The present invention is further directed to nucleic acid molecules encoding 
portions or fragments of the nucleotide sequences described herein. Fragments include 
portions of the nucleotide sequences of Table 1, or the E.faecalis nucleotide 

20 sequences contained in the plasimd clones listed in Table 1, at least 10 contiguous 
nucleotides in length selected from any two integers, one of which representing a 5' 
nucleotide position and a second of which representing a 3' nucleotide position, where 
the first nucleotide for each nucleotide sequence in Table 1 is position 1 . That is, 
every combination of a 5' and 3' nucleotide position that a fragment at least 10 

25 contiguous nucleotides in length could occupy is included in the invention. At least 
means a fragment may be 10 contiguous nucleotide bases in length or any integer 
between 10 and the length of an entire nucleotide sequence of Table 1 minus 1. 
Therefore, included in the invention are contiguous fragments specified by any 5' and 
3' nucleotide base positions of a nucleotide sequences of Table 1 wherein the 
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contiguous fragment is any integer between 10 and the length of an entire nucleotide 
sequence minus 1 . 

Further, the invention includes polynucleotides comprising fragments specified 
by size, in nucleotides, rather than by nucleotide positions. The invention includes 
5 any fragment size, in contiguous nucleotides, selected from integers between 1 0 and 
the length of an entire nucleotide sequence minus 1 . Preferred sizes of contiguous 
nucleotide fragments include 20 nucleotides, 30 nucleotides, 40 nucleotides, 50 
nucleotides. Other preferred sizes of contiguous nucleotide fragments, which may be 
useful as diagnostic probes and primers, include fragments 50-300 nucleotides in 

10 length which include, as discussed above, fragment sizes representing each integer 
between 50-300. Larger fragments are also useful according to the present invention 
corresponding to most, if not all, of the nucleotide sequences shown in Table lor of 
the E.faecalis nucleotide sequences of the plasimd clones listed in Table 1. The 
preferred sizes are, of course, meant to exemplify not limit the present invention as all 

15 size fragments, representing any integer between 10 and the length of an entire 
nucleotide sequence minus 1, are included in the invention. Additional preferred 
nucleic acid fragments of the present invention include nucleic acid molecules encoding 
epitope-bearing portions of E, faecalis polypeptides identified in Table 4. 

The present invention also provides for the exclusion of any fragment, 

20 specified by 5 1 and 3* base positions or by size in nucleotide bases as described above 
for any nucleotide sequence of Table 1 or the plasimd clones listed in Table 1 . Any 
number of fragments of nucleotide sequences in Table 1 or the plasimd clones listed in 
Table 1, specified by 5 1 and 3' base positions or by size in nucleotides, as described 
above, may be excluded from the present invention. 

25 In another aspect, the invention provides an isolated nucleic acid molecule 

comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of a polynucleotide in a nucleic acid molecules of the invention 
described above, for instance, nucleotide sequences of Table 1 or the E.faecalis 
sequences of the plasimd clones listed in Table 1 . By "stringent hybridization 
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conditions" is intended overnight incubation at 42°C in a solution comprising: 50% 
formamide, 5x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 |ig/ml 
denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1 x SSC at 
5 about 65°C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide is 
intended a polynucleotide (either DNA or RNA) hybridizing to at least about 15 
nucleotides bases, and more preferably at least about 20 nucleotides bases, still more 
preferably at least about 30 nucleotides bases, and even more preferably about 30-70 

10 (e.g., 50) nucleotides bases of the reference polynucleotide. These are useful as 

diagnostic probes and primers as discussed above. By a portion of a polynucleotide 
of "at least 20 nucleotides bases in length," for example, is intended 20 or more 
contiguous nucleotides bases nucleotides from the nucleotide sequence of the reference 
polynucleotide (e.g., the nucleotide sequence as shown in Table 1). Portions of a 

15 polynucleotide which hybridizes to a nucleotide sequence in Table 1, which can be 
used as probes and primers, may also be precisely specified by 5' and V base 
positions or by size in nucleotide bases as described above or precisely excluded in the 
same manner. 

The nucleic acid molecules of the present invention include those encoding the 
20 full length E.faecalis polypeptides of Table 1 and portions of the E.faecalis 
polypeptides of Table 1 . Also included in the present invention are nucleic acids 
encoding the above full length sequences and further comprise additional sequences, 
such as those encoding an added secretory leader sequence, such as a pre-, or pro- or 
prepro- protein sequence. Further included in the present invention are nucleic acids 
25 encoding the above full length sequences and portions thereof and further comprise 
additional heterologous amino acid sequences encoded by nucleic acid sequences from 
a different source. 

Also included in the present invention are nucleic acids encoding the above 
protein sequences together with additional, non-coding sequences, including for 
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example, but not limited to non-coding 5' and 3' sequences. These sequences include 
transcribed, non-translated sequences that may play a role in transcription, and 
mRNA processing, for example, ribosome binding and stability of mRNA. Also 
included in the present invention are additional coding sequences which provide 

5 additional functionalities. 

Thus, a nucleotide sequence encoding a polypeptide may be fused to a marker 
sequence, such as a sequence encoding a peptide which facilitates purification of the 
fused polypeptide. In certain preferred embodiments of this aspect of the invention, 
the marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in 

10 a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among 
others, many of which are commercially available. For instance, hexa-histidine 
provides for convenient purification of the fusion protein. See Gentz et al. (1989) 
Proc. Natl. Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for 
purification which corresponds to an epitope derived from the influenza hemagglutinin 

15 protein. See Wilson et al. (1984) Cell 37:767. As discussed below, other such fusion 
proteins include the E. faecalis polypeptides of the present invention fused to Fc at 
the N- or C-terminus. 

Variant and Mutant Polynucleotides 
20 The present invention further relates to variants of the nucleic acid molecules 

which encode portions, analogs or derivatives of zE. faecalis polypeptides of Table 1 

and variant polypeptides thereof including portions, analogs, and derivatives of the E. 

faecalis polypeptides. Variants may occur naturally, such as a natural allelic variant. 

By an "allelic variant" is intended one of several alternate forms of a gene occupying a 
25 given locus on a chromosome of an organism. See, e.g., B. Lewin, Genes IV (1 990). 

Non-naturally occurring variants may be produced using art-known mutagenesis 

techniques. 

Such nucleic acid variants include those produced by nucleotide substitutions, 
deletions, or additions. The substitutions, deletions, or additions may involve one or 
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more nucleotides. The variants may be altered in coding regions, non-coding regions, 
or both. Alterations in the coding regions may produce conservative or 
non-conservative amino acid substitutions, deletions or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do not 
5 alter the properties and activities of a E.faecalis protein of the present invention or 
portions thereof. Also especially preferred in this regard are conservative 
substitutions. 

Such polypeptide variants include those produced by amino acid 
substitutions, deletions or additions. The substitutions, deletions, or additions may 

1 0 involve one or more residues. Alterations may produce conservative or 

non-conservative amino acid substitutions, deletions, or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do not 
alter the properties and activities of a E. faecalis protein of the present invention or 
portions thereof. Also especially preferred in this regard are conservative 

15 substitutions. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing 
the recombinant vectors, as well as to methods of making such vectors and host cells 
and for using them for production of E. faecalis polypeptides or peptides by 

20 recombinant techniques. 

The present application is directed to nucleic acid molecules at least 90%, 
95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1 . 
The above nucleic acid sequences are included irrespective of whether they encode a 
polypeptide having E.faecalis activity. This is because even where a particular 

25 nucleic acid molecule does not encode a polypeptide having E. faecalis activity, one of 
skill in the art would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe. Uses of the nucleic acid molecules of the present invention that 
do not encode a polypeptide having E.faecalis activity include, inter alia, isolating an 
E.faecalis gene or allelic variants thereof from a DNA library, and detecting E.faecalis 
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mRNA expression samples, environmental samples, suspected of containing E. 
faecalis by Northern Blot analysis. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 
97%, 98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, 
5 in fact, encode a polypeptide having E, faecalis protein activity By M a polypeptide 
having E. faecalis activity" is intended polypeptides exhibiting activity similar, but 
not necessarily identical, to an activity of the E. faecalis protein of the invention, as 
measured in a particular biological assay suitable for measuring activity of the 
specified protein. 

10 Due to the degeneracy of the genetic code, one of ordinary skill in the art will 

immediately recognize that a large number of the nucleic acid molecules having a 
sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid 
sequences shown in Table 1 will encode a polypeptide having E. faecalis protein 
activity. In fact, since degenerate variants of these nucleotide sequences all encode the 

15 same polypeptide, this will be clear to the skilled artisan even without performing the 
above described comparison assay. It will be further recognized in the art that, for 
such nucleic acid molecules that are not degenerate variants, a reasonable number will 
also encode a polypeptide having E. faecalis protein activity. This is because the 
skilled artisan is fully aware of amino acid substitutions that are either less likely or 

20 not likely to significantly effect protein function (e.g., replacing one aliphatic amino 
acid with a second aliphatic amino acid), as further described below. 

The biological activity or function of the polypeptides of the present 
invention are expected to be similar or identical to polypeptides from other bacteria 
that share a high degree of structural identity/similarity. Tables 2 lists accession 

25 numbers and descriptions for the closest matching sequences of polypeptides 

available through Genbank and Derwent databases. It is therefore expected that the 
biological activity or function of the polypeptides of the present invention will be 
similar or identical to those polypeptides from other bacterial genuses, species, or 
strains listed in Table 2. 
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By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point 
5 mutations per each 1 00 nucleotides of the reference nucleotide sequence encoding the 
E. faecalis polypeptide. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 
5% of the nucleotides in the reference sequence may be deleted, inserted, or 
substituted with another nucleotide. The query sequence may be an entire sequence 
10 shown in Table 1 , the ORF (open reading frame), or any fragment specified as 
described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 

15 computer programs. A preferred method for determining the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using the 
FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et 
al. (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and 

20 subject sequences are both DNA sequences. An RNA sequence can be compared by 
first converting U's to T's. The result of said global sequence alignment is in percent 
identity. Preferred parameters used in a FASTDB alignment of DNA sequences to 
calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, 
Joining Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap 

25 Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject 
nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' 
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truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 
5' and 3' of the subject sequence, which are not matched/aligned, as a percent of the 
5 total bases of the query sequence. Whether a nucleotide is matched/aligned is 

determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only nucleotides 

10 outside the 5' and 3' nucleotides of the subject sequence, as displayed by the 
FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes of manually adjusting the percent identity score. 

For example, a 90 nucleotide subject sequence is aligned to a 1 00 nucleotide 
query sequence to determine percent identity. The deletions occur at the 5' end of the 

1 5 subject sequence and therefore, the FASTDB alignment does not show a 

matched/alignment of the first 1 0 nucleotides at 5' end. The 10 unpaired nucleotides 
represent 10% of the sequence (number of nucleotides at the 5' and 3' ends not 
matched/total number of nucleotides in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 

20 nucleotides were perfectly matched the final percent identity would be 90%. In 

another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide 
query sequence. This time the deletions arc internal deletions so that there are no 
nucleotides on the 5' or 3* of the subject sequence which are not matched/aligned with 
the query. In this case the percent identity calculated by FASTDB is not manually 

25 corrected. Once again, only nucleotides 5' and 3' of the subject sequence which are 
not matched/aligned with the query sequence are manually corrected for. No other 
manual corrections are to made for the purposes of the present invention. 

Vectors and Host Cell 
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The present invention also relates to vectors which include the isolated DMA 
molecules of the present invention, host cells comprising the recombinant vectors, and 
the production of E. faecalis polypeptides and peptides of the present invention 
expressed by the host cells. 
5 Recombinant constructs may be introduced into host cells using well known 

techniques such as infection, transduction, transfection, transvection, electroporation 
and transformation. The vector may be, for example, a phage, plasmid, viral or 
retroviral vector. Retroviral vectors may be replication competent or replication 
defective. In the latter case, viral propagation generally will occur only in 
1 0 complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 
15 and then transduced into host cells. 

Preferred are vectors comprising cw-acting control regions to the 
polynucleotide of interest. Appropriate fra^s-acting factors may be supplied by the 
host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 
20 In certain preferred embodiments in this regard, the vectors provide for 

specific expression, which may be inducible and/or cell type-specific. Particularly 
preferred among such vectors are those inducible by environmental factors that are 
easy to manipulate, such as temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, 
25 episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, 
bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as 
baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, 
such as cosmids and phagemids. 
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The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
5 constructs will further contain sites for transcription initiation, termination and, in the 
transcribed region, a ribosome binding site for translation. The coding portion of the 
mature transcripts expressed by the constructs will preferably include a translation 
initiating site at the beginning and a termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

10 As indicated, the expression vectors will preferably include at least one 

selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin, or ampicillin 
resistance genes for culturing in E. coli and other bacteria. Representative examples of 
appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 

15 Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera S19 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE9, 

20 pQEl 0 available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, 
pNH8A, pNH16a, pNH18A, pNH46A available from Stratagene; pET series of 
vectors available from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, 
pRIT5 available from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3, pBPV, 

25 pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily 
apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention 
include the E. coli lac\ and lacZ promoters, the T3, T5 and T7 promoters, the gpt 
promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic 
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promoters include the CMV immediate early promoter, the HSV thymidine kinase 
promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such 
as those of the Rous sarcoma virus (RSV), and metallothionein promoters, such as the 
mouse metallothionein-1 promoter. 
5 Introduction of the construct into the host cell can be effected by calcium 

phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection or other methods. Such methods 
are described in many standard laboratory manuals (for example, Davis, et ai, Basic 
Methods In Molecular Biology (1986)). 

10 Transcription of DNA encoding the polypeptides of the present invention by 

higher eukaryotes may be increased by inserting an enhancer sequence into the vector. 
Enhancers are exacting elements of DNA, usually about from 10 to 300 nucleotides 
that act to increase transcriptional activity of a promoter in a given host cell-type. 
Examples of enhancers include the SV40 enhancer, which is located on the late side of 

15 the replication origin at nucleotides 100 to 270, the cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and 
adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 

20 appropriate secretion signals may be incorporated into the expressed polypeptide, for 
example, the amino acid sequence KDEL. The signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional heterologous 

25 functional regions. For instance, a region of additional amino acids, particularly 

charged amino acids, may be added to the N-terminus of the polypeptide to improve 
stability and persistence in the host cell, during purification, or during subsequent 
handling and storage. Also, peptide moieties may be added to the polypeptide to 
facilitate purification. Such regions may be removed prior to final preparation of the 
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polypeptide. The addition of peptide moieties to polypeptides to engender secretion 
or excretion, to improve stability and to facilitate purification, among others, are 
familiar and routine techniques in the art. A preferred fusion protein comprises a 
heterologous region from immunoglobulin that is useful to solubilize proteins. For 
5 example, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion proteins 
comprising various portions of constant region of immunoglobulin molecules together 
with another human protein or part thereof. In many cases, the Fc part in a fusion 
protein is thoroughly advantageous for use in therapy and diagnosis and thus results, 
for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 

10 hand, for some uses it would be desirable to be able to delete the Fc part after the 
fusion protein has been expressed, detected and purified in the advantageous manner 
described. This is the case when Fc portion proves to be a hindrance to use in 
therapy and diagnosis, for example when the fusion protein is to be used as antigen for 
immunizations. In drug discovery, for example, human proteins, such as, 

15 hlL5-receptor has been fused with Fc portions for the purpose of high-throughput 
screening assays to identify antagonists of hIL-5. See Bennett, D. et al. (1995) J. 
Molec. Recogn. 8:52-58 and Johanson, K. et al (1995) J. Biol Chem. 270 
(16):9459-9471. 

The E.faecalis polypeptides can be recovered and purified from recombinant 
20 cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxy! apatite chromatography, lectin chromatography and high 
performance liquid chromatography ("HPLC") is employed for purification. 
25 Polypeptides of the present invention include naturally purified products, products of 
chemical synthetic procedures, and products produced by recombinant techniques 
from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. 
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Polypeptides and Fragments 

The invention further provides an isolated E. faecalis polypeptide having an 
amino acid sequence in Table 1 , or a peptide or polypeptide comprising a portion of 
the above polypeptides. 

5 

Variant and Mutant Polypeptides 

To improve or alter the characteristics of E. faecalis polypeptides of the 
present invention, protein engineering may be employed. Recombinant DNA 
technology known to those skilled in the art can be used to create novel mutant 
10 proteins or muteins including single or multiple amino acid substitutions, deletions, 
additions, or fusion proteins. Such modified polypeptides can show, e.g., enhanced 
activity or increased stability. In addition, they may be purified in higher yields and 
show better solubility than the corresponding natural polypeptide, at least under ' 
certain purification and storage conditions. 

15 

N-Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the 
N-terminus or C-terminus without substantial loss of biological function. For 
instance, Ron et al. J. Biol. Chem., 268:2984-2988 (1993), reported modified KGF 

20 proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid 
residues were missing. Accordingly, the present invention provides polypeptides 
having one or more residues deleted from the amino terminus of the amino acid 
sequence of the E. faecalis polypeptides shown in Table 1 , and polynucleotides 
encoding such polypeptides. 

25 Similarly, many examples of biologically functional C-terminal deletion 

muteins are known. For instance, Interferon gamma shows up to ten times higher 
activities by deleting 8-10 amino acid residues from the carboxy terminus of the 
protein See, e.g., Dobeli, et al. (1 988) J. Biotechnology 7: 1 99-2 1 6. Accordingly, the 
present invention provides polypeptides having one or more residues from the 
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carboxy terminus of the amino acid sequence of the E.faecalis polypeptides shown in 
Table 1 . The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl termini as described below. 

The present invention is further directed to polynucleotide encoding portions 
5 or fragments of the amino acid sequences described herein as well as to portions or 
fragments of the isolated amino acid sequences described herein. Fragments include 
portions of the amino acid sequences of Table 1 , are at least 5 contiguous amino acid 
in length, are selected from any two integers, one of which representing a N-terminal 
position. The initiation codon of the polypeptides of the present inventions position 

10 1 . Every combination of a N-terminal and C-terminal position that a fragment at least 
5 contiguous amino acid residues in length could occupy, on any given amino acid 
sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of 
residues in a full length amino acid sequence minus 1. Therefore, included in the 

15 invention are contiguous fragments specified by any N-terminal and C-terminal 

positions of amino acid sequence set forth in Table 1 wherein the contiguous fragment 
is any integer between 5 and the number of residues in a full length sequence minus 1 . 

Further, the invention includes polypeptides comprising fragments specified 
by size, in amino acid residues, rather than by N-terminal and C-terminal positions. 

20 The invention includes any fragment size, in contiguous amino acid residues, selected 
from integers between 5 and the number of residues in a full length sequence minus 1 . 
Preferred sizes of contiguous polypeptide fragments include about 5 amino acid 
residues, about 10 amino acid residues, about 20 amino acid residues, about 30 amino 
acid residues, about 40 amino acid residues, about 50 amino acid residues, about 100 

25 amino acid residues, about 200 amino acid residues, about 300 amino acid residues, 
and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any 
integer between 5 and the number of residues in a full length sequence minus 1 are 
included in the invention. The present invention also provides for the exclusion of any 
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fragments specified by N-terminal and C-terminal positions or by size in amino acid 
residues as described above. Any number of fragments specified by N-terminal and 
C-terminal positions or by size in amino acid residues as described above may be 
excluded. 

5 The above fragments need not be active since they would be useful, for 

example, in immunoassays, in epitope mapping, epitope tagging, to generate 
antibodies to a particular portion of the protein, as vaccines, and as molecular weight 
markers. 

10 Other Mutants 

In addition to N- and C-terminal deletion forms of the protein discussed above, 
it also will be recognized by one of ordinary skill in the art that some amino acid 
sequences of the E.faecalis polypeptide can be varied without significant effect of the 
structure or function of the protein. If such differences in sequence are contemplated, 

15 it should be remembered that there will be critical areas on the protein which 
determine activity. 

Thus, the invention further includes variations of the E.faecalis polypeptides 
which show substantial E.faecalis polypeptide activity or which include regions of E. 
faecalis protein such as the protein portions discussed below. Such mutants include 

20 deletions, insertions, inversions, repeats, and type substitutions selected according to 
general rules known in the art so as to have little effect on activity. For example, 
guidance concerning how to make phenotypically silent amino acid substitutions is 
provided. There are two main approaches for studying the tolerance of an amino acid 
sequence to change. See, Bowie, J. U. et al (1990), Science 247:1306-1310. The first 

25 method relies on the process of evolution, in which mutations are either accepted or 
rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections or 
screens to identify sequences that maintain functionality. 

These studies have revealed that proteins are surprisingly tolerant of amino 
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acid substitutions. The studies indicate which amino acid changes are likely to be 
permissive at a certain position of the protein. For example, most buried amino acid 
residues require nonpolar side chains, whereas few features of surface side chains are 
generally conserved. Other such phenotypically silent substitutions are described by 
5 Bowie et al. {supra) and the references cited therein. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino acids 
Ala, Val, Leu and He; interchange of the hydroxyl residues Ser and Thr, exchange of 
the acidic residues Asp and Glu, substitution between the amide residues Asn and 
Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 

]0 residues Phe, Tyr. 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 
1 , or that encoded by the plaimds listed in Table 1 , may be: (i) one in which one or 
more of the amino acid residues are substituted with a conserved or non-conserved 
amino acid residue (preferably a conserved amino acid residue) and such substituted 

1 5 amino acid residue may or may not be one encoded by the genetic code: or (ii) one in 
which one or more of the amino acid residues includes a substituent group: or (iii) one 
in which the E.faecalis polypeptide is fused with another compound, such as a 
compound to increase the half-life of the polypeptide (for example, polyethylene 
glycol): or (iv) one in which the additional amino acids are fused to the above form of 

20 the polypeptide, such as an lgG Fc fusion region peptide or leader or secretory 

sequence or a sequence which is employed for purification of the above form of the 
polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are 
deemed to be within the scope of those skilled in the art from the teachings herein. 

Thus, the E. faecalis polypeptides of the present invention may include one or 

25 more amino acid substitutions, deletions, or additions, either from natural mutations or 
human manipulation. As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding or 
activity of the protein (see Table 3). 

Amino acids in the E.faecalis proteins of the present invention that are 
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essential for function can be identified by methods known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis. See, e.g., Cunningham et al. 
(1989) Science 244:1081-1085. The latter procedure introduces single alanine 
mutations at every residue in the molecule. The resulting mutant molecules are then 
5 tested for biological activity using assays appropriate for measuring the function of 
the particular protein. 

Of special interest are substitutions of charged amino acids with other charged 
or neutral amino acids which may produce proteins with highly desirable improved 
characteristics, such as less aggregation. Aggregation may not only reduce activity but 

10 also be problematic when preparing pharmaceutical formulations, because aggregates 
can be immunogenic. See, e.g., Pinckard et al., (1967) Clin. Exp. Immunol. 2:331-340; 
Robbins, et al., (1987) Diabetes 36:838-845; Cleland, et al., (1993) Crit. Rev. 
Therapeutic Drug Carrier Systems 10:307-377. 

The polypeptides of the present invention are preferably provided in an 

1 5 isolated form, and preferably are substantially purified. A recombinantly produced 
version of the E. faecalis polypeptide can be substantially purified by the one-step 
method described by Smith et al. (1988) Gene 67:31-40. Polypeptides of the 
invention also can be purified from natural or recombinant sources using antibodies 
directed against the polypeptides of the invention in methods which are well known in 

20 the art of protein purification. 

The invention further provides for isolated E. faecalis polypeptides 
comprising an amino acid sequence selected from the group consisting of: (a) the 
amino acid sequence of a full-length E. faecalis polypeptide having the complete 
amino acid sequence shown in Table 1 ; (b) the amino acid sequence of a full-length E. 

25 faecalis polypeptide having the complete amino acid sequence shown in Table 1 
excepting the N-terminal methionine; (c) the complete amino acid sequence encoded 
by the plaimds listed in Table 1; and (d) the complete amino acid sequence excepting 
the N-terminal methionine encoded by the plaimds listed in Table 1 . The 
polypeptides of the present invention also include polypeptides having an amino acid 
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sequence at least 80% identical, more preferably at least 90% identical, and still more 
preferably 95%, 96%, 97%, 98% or 99% identical to those described in (a), (b), (c), 
and (d) above. 

Further polypeptides of the present invention include polypeptides which 
5 have at least 90% similarity, more preferably at least 95% similarity, and still more 
preferably at least 96%, 97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of a E.faecalis polypeptide having an amino acid 
sequence which contains at least one conservative amino acid substitution, but not 

10 more than 50 conservative amino acid substitutions, not more than 40 conservative 
amino acid substitutions, not more than 30 conservative amino acid substitutions, and 
not more than 20 conservative amino acid substitutions. Also provided are 
polypeptides which comprise the amino acid sequence of a E.faecalis polypeptide, 
having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino 

15 acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 

20 alterations per each 100 amino acids of the query amino acid sequence. In other 

words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino or carboxy 

25 terminal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 
95%, 96%, 97%>, 98% or 99% identical to, for instance, the amino acid sequences 
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shown in Table 1 or to the amino acid sequence encoded by the plaimds listed in Table 
1 can be determined conventionally using known computer programs. A preferred 
method for determining the best overall match between a query sequence (a sequence 
of the present invention) and a subject sequence, also referred to as a global sequence 
5 alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence 
alignment the query and subject sequences are both amino acid sequences. The result 
of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch 

10 Penalty=l , Joining Penal ty=20, Randomization Group Length=0, Cutoff Score=l , 
Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window 
Size=500 or the length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, the results, in percent identity, 

15 must be manually corrected. This is because the FASTDB program does not account 
for N- and C-terminal truncations of the subject sequence when calculating global 
percent identity. For subject sequences truncated at the N- and C-termini, relative to 
the query sequence, the percent identity is corrected by calculating the number of 
residues of the query sequence that are N- and C-terminal of the subject sequence, 

20 which are not matched/aligned with a corresponding subject residue, as a percent of 
the total bases of the query sequence. Whether a residue is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This final percent 

25 identity score is what is used for the purposes of the present invention. Only 
residues to the N- and C-termini of the subject sequence, which are not 
matched/aligned with the query sequence, are considered for the purposes of manually 
adjusting the percent identity score. That is, only query amino acid residues outside 
the farthest N- and C-terminal residues of the subject sequence. 
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For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not 
match/align with the first 10 residues at the N-terminus. The 10 unpaired residues 

5 represent 1 0% of the sequence (number of residues at the N- and C- termini not 
matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 
residues were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query 

10 sequence. This time the deletions are internal so there are no residues at the N- or C- 
tennini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected. Once 
again, only residue positions outside the N- and C-terminal ends of the subject 
sequence, as displayed in the FASTDB alignment, which are not matched/aligned 

1 5 with the query sequence are manually corrected. No other manual corrections are to 
made for the purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they 
have their normal biological activity. This is because even where a particular 
polypeptide molecule does not have biological activity, one of skill in the art would 

20 still know how to use the polypeptide, for instance, as a vaccine or to generate 

antibodies. Other uses of the polypeptides of the present invention that do not have 
E.faecalis activity include, inter alia, as epitope tags, in epitope mapping, and as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods known to those of skill in the art. 

25 As described below, the polypeptides of the present invention can also be 

used to raise polyclonal and monoclonal antibodies, which are useful in assays for 
detecting E.faecalis protein expression or as agonists and antagonists capable of 
enhancing or inhibiting E.faecalis protein function. Further, such polypeptides can be 
used in the yeast two-hybrid system to "capture" E.faecalis protein binding proteins 
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which are also candidate agonists and antagonists according to the present invention. 
See, e.g., Fields et al. (1989) Nature 340:245-246. 

Epitope-Bearing Portions 
5 In another aspect, the invention provides peptides and polypeptides 

comprising epitope-bearing portions of the E. faecalis polypeptides of the present 
invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides 
of the present invention. An "immunogenic epitope" is defined as a part of a protein 
that elicits an antibody response when the whole protein or polypeptide is the 

10 immunogen. These immunogenic epitopes are believed to be confined to a few loci on 
the molecule. On the other hand, a region of a protein molecule to which an antibody 
can bind is defined as an "antigenic determinant" or "antigenic epitope." The number 
of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 

1 5 Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 
4 only lists amino acid residues comprising epitopes predicted to have the highest 
degree of antigenicity. The polypeptides not listed in Table 4 and portions of 
polypeptides not listed in Table 4 are not considered non-antigenic. This is because 
they may still be antigenic in vivo but merely not recognized as such by the particular 

20 algorithm used. Thus, Table 4 lists the amino acid residues comprising preferred 
antigenic epitopes but not a complete list. Amino acid residues comprising other 
anigenic epitopes may be determined by algorithms similar to the Jameson-Wolf 
analysis or by in vivo testing for an antigenic response using the methods described 
herein or those known in the art. 

25 As to the selection of peptides or polypeptides bearing an antigenic epitope 

{i.e., that contain a region of a protein molecule to which an antibody can bind), it is 
well known in that art that relatively short synthetic peptides that mimic part of a 
protein sequence are routinely capable of eliciting an antiserum that reacts with the 
partially mimicked protein. See, e.g., Sutcliffe, et al., (1983) Science 219:660-666. 
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Peptides capable of eliciting protein-reactive sera are frequently represented in the 
primary sequence of a protein, can be characterized by a set of simple chemical rules, 
and are confined neither to immunodominant regions of intact proteins (i.e., 
immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are 
5 extremely hydrophobic and those of six or fewer residues generally are ineffective at 
inducing antibodies that bind to the mimicked protein; longer, peptides, especially 
those containing proline residues, usually are effective. See, Sutcliffe, et al., supra, p. 
661. For instance, 18 of 20 peptides designed according to these guidelines, containing 
8-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HA1 

10 polypeptide chain, induced antibodies that reacted with the HA1 protein or intact 
virus; and 12/12 peptides from the MuLV polymerase and 1 8/18 from the rabies 
glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 

15 specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas 
obtained by fusion of spleen cells from donors immunized with an antigen 
epitope-bearing peptide generally secrete antibody reactive with the native protein. 
See Sutcliffe, et al., supra, p. 663. The antibodies raised by antigenic epitope-bearing 
peptides or polypeptides are useful to detect the mimicked protein, and antibodies to 

20 different peptides may be used for tracking the fate of various regions of a protein 
precursor which undergoes post-translational processing. The peptides and 
anti-peptide antibodies may be used in a variety of qualitative or quantitative assays 
for the mimicked protein, for instance in competition assays since it has been shown 
that even short peptides (e.g., about 9 amino acids) can bind and displace the larger 

25 peptides in immunoprecipitation assays. See, e.g., Wilson, et al., (1984) Cell 
37:767-778. The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography 
using methods known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 



WO 98/50554 



-37- 



PCT/US98/08959 



designed according to the above guidelines preferably contain a sequence of at least 
seven, more preferably at least nine and most preferably between about 10 to about 
50 amino acids (i.e. any integer between 7 and 50) contained within the amino acid 
sequence of a polypeptide of the invention. However, peptides or polypeptides 

5 comprising a larger portion of an amino acid sequence of a polypeptide of the 
invention, containing about 50 to about 100 amino acids, or any length up to and 
including the entire amino acid sequence of a polypeptide of the invention, also are 
considered epitope-bearing peptides or polypeptides of the invention and also are 
useful for inducing antibodies that react with the mimicked protein. Preferably, the 

10 amino acid sequence of the epitope-bearing peptide is selected to provide substantial 
solubility in aqueous solvents (i.e., the sequence includes relatively hydrophilic 
residues and highly hydrophobic sequences are preferably avoided); and sequences 
containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used 

15 to generate an enterococcal-specific immune response or antibodies include portions of 
the amino acid sequences identified in Table 1. More specifically, Table 4 discloses a 
list of non-limiting residues that are involved in the antigenicity of the epitope-bearing 
fragments of the present invention. Therefore, the present inventions provides for 
isolatd and purified antigenic epitope-bearing fragements of the polypeptides of the 

20 present invention comprising a peptide sequences of Table 4. The antigenic epitope- 
bearing fragments comprising a peptide sequence of Table 4 preferably contain a 
sequence of at least seven, more preferably at least nine and most preferably between 
about 10 to about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide 
of the present invention. That is, included in the present invention are antigenic 

25 polypeptides between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4. Therefore, in most cases, the polypeptides of 
Table 4 make up only a portion of the antigenic polypeptide. All combinations of 
sequences between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4 arc included. The antigenic epitope-bearing 
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fragements may be specified by either the number of contiguous amino acid residues 
or by specific N-terminal and C-terminal positions as described above for the 
polypeptide fragements of the present invention, wherein the initiation codon is 
residue 1 . Any number of the described antigenic epitope-bearing fragements of the 
5 present invention may also be excluded from the present invention in the same 
manner. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides including 
recombinant means using nucleic acid molecules of the invention. For instance, an 

10 epitope-bearing amino acid sequence of the present invention may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and purification, as 
well as during immunization to produce anti-peptide antibodies. Epitope-bearing 
peptides also may be synthesized using known methods of chemical synthesis. For 
instance, Houghten has described a simple method for synthesis of large numbers of 

15 peptides, such as 10-20 mg of 248 different 13 residue peptides representing single 
amino acid variants of a segment of the HA1 polypeptide which were prepared and 
characterized (by ELISA-type binding studies) in less than four weeks (Houghten, R. 
A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple 
Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 4,631,21 1 

20 to Houghten and coworkers (1986). In this procedure the individual resins for the 
solid-phase synthesis of various peptides are contained in separate solvent-permeable 
packets, enabling the optimal use of the many identical repetitive steps involved in 
solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. 

25 Sci. 82:5131-5135 at 5134. 

Epitope-bearing peptides and polypeptides of the invention are used to induce 
antibodies according to methods well known in the art. See, e.g., Sutcliffe, et al., 
supra;; Wilson, et al., supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354. 
Generally, animals may be immunized with free peptide; however, anti-peptide 
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antibody titer may be boosted by coupling of the peptide to a macromolecular carrier, 
such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides 
containing cysteine may be coupled to carrier using a linker such as 
m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may 
5 be coupled to carrier using a more general linking agent such as glutaraldehyde. 
Animals such as rabbits, rats and mice are immunized with either free or 
carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection 
of emulsions containing about 100 |ag peptide or carrier protein and Freund's adjuvant. 
Several booster injections may be needed, for instance, at intervals of about two 

10 weeks, to provide a useful titer of anti-peptide antibody which can be detected, for 
example, by EL1SA assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by 
selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a 
solid support and elution of the selected antibodies according to methods well known 

15 in the art. 

Immunogenic epitope-bearing peptides of the invention, le. 9 those parts of a 
protein that elicit an antibody response when the whole protein is the immunogen, are 
identified according to methods known in the art. For instance, Geysen, et al 9 supra, 
discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of 

20 peptides of sufficient purity to react in an ELISA. Interaction of synthesized 
peptides with antibodies is then easily detected without removing them from the 
support. In this manner a peptide bearing an immunogenic epitope of a desired 
protein may be identified routinely by one of ordinary skill in the art. For instance, 
the immunologically important epitope in the coat protein of foot-and-mouth disease 

25 virus was located by Geysen et al supra with a resolution of seven amino acids by 
synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 
213 amino acid sequence of the protein. Then, a complete replacement set of peptides 
in which all 20 amino acids were substituted in turn at every position within the 
epitope were synthesized, and the particular amino acids conferring specificity for the 
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reaction with antibody were determined. Thus, peptide analogs of the epitope-bearing 
peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Geysen (1987) further describes this method of identifying a peptide 
bearing an immunogenic epitope of a desired protein. 

5 Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general 

method of detecting or determining the sequence of monomers (amino acids or other 
compounds) which is a topological equivalent of the epitope (i.e., a "mirnotope") 
which is complementary to a particular paratope (antigen binding site) of an antibody 
of interest. More generally, U.S. Patent No. 4,433,092, also to Geysen (1989), 

10 describes a method of detecting or determining a sequence of monomers which is a 
topographical equivalent of a ligand which is complementary to the ligand binding site 
of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 to Houghten, 
R. A. et al (1996) discloses linear C r C 7 -alkyl peralkylated oligopeptides and sets and 
libraries of such peptides, as well as methods for using such oligopeptide sets and 

15 libraries for determining the sequence of a peralkylated oligopeptide that preferentially 
binds to an acceptor molecule of interest. Thus, non-peptide analogs of the 
epitope-bearing peptides of the invention also can be made routinely by these 
methods. The entire disclosure of each document cited in this section on 
"Polypeptides and Fragments" is hereby incorporated herein by reference. 

20 As one of skill in the art will appreciate, the polypeptides of the present 

invention and the epitope-bearing fragments thereof described above can be combined 
with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric 
polypeptides. These fusion proteins facilitate purification and show an increased 
half-life in vivo. This has been shown, e.g., for chimeric proteins consisting of the 

25 first two domains of the human CD4-polypeptide and various domains of the 

constant regions of the heavy or light chains of mammalian immunoglobulins. (EPA 
0,394,827; Traunecker et al. (1 988) Nature 33 1 :84-86. Fusion proteins that have a 
disulfide-linked dimeric structure due to the IgG part can also be more efficient in 
binding and neutralizing other molecules than a monomelic E. faecalis polypeptide or 



WO 98/50554 



-41- 



PCT/US98/08959 



fragment thereof alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964. 
Nucleic acids encoding the above epitopes of E.faecalis polypeptides can also be 
recombined with a gene of interest as an epitope tag to aid in detection and 
purification of the expressed polypeptide. 

5 

Antibodies 

E.faecalis protein-specific antibodies for use in the present invention can be 
raised against the intact E. faecalis protein or an antigenic polypeptide fragment 
thereof, which may be presented together with a carrier protein, such as an albumin, to 

10 an animal system (such as rabbit or mouse) or, if it is long enough (at least about 25 
amino acids), without a carrier. 

As used herein, the tenn "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules, single chain whole antibodies, and antibody 
fragments. Antibody fragments of the present invention include Fab and F(ab ! )2 and 

15 other fragments including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv). 
Also included in the present invention are chimeric and humanized monoclonal 
antibodies and polyclonal antibodies specific for the polypeptides of the present 
invention. The antibodies of the present invention may be prepared by any of a 
variety of methods. For example, cells expressing a polypeptide of the present 

20 invention or an antigenic fragment thereof can be administered to an animal in order to 
induce the production of sera containing polyclonal antibodies. For example, a 
preparation of E.faecalis polypeptide or fragment thereof is prepared and purified to 
render it substantially free of natural contaminants. Such a preparation is then 
introduced into an animal in order to produce polyclonal antisera of greater specific 

25 activity. 

In a preferred method, the antibodies of the present invention are monoclonal 
antibodies or binding fragments thereof. Such monoclonal antibodies can be prepared 
using hybridoma technology. See, e.g., Harlow et al., ANTIBODIES: A 
LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); 
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Hammerling, et al., in: MONOCLONAL ANTIBODIES AND T-CELL 
HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and F(ab')2 fragments may be 
produced by proteolytic cleavage, using enzymes such as papain (to produce Fab 
fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, E.faecalis 
5 polypeptide-binding fragments, chimeric, and humanized antibodies can be produced 
through the application of recombinant DNA technology or through synthetic 
chemistry using methods known in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide 
antigen of the present invention may be produced in a two-step procedure through the 

10 use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies 
are themselves antigens, and that, therefore, it is possible to obtain an antibody which 
binds to a second antibody. In accordance with this method, E. faecalis 
polypeptide-specific antibodies are used to immunize an animal, preferably a mouse. 
The splenocytes of such an animal are then used to produce hybridoma cells, and the 

15 hybridoma cells are screened to identify clones which produce an antibody whose 
ability to bind to the E. faecalis polypeptide-specific antibody can be blocked by the 
E.faecalis polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to 
the E.faecalis polypeptide-specific antibody and can be used to immunize an animal 
to induce formation of further E.faecalis polypeptide-specific antibodies. 

20 Antibodies and fragements thereof of the present invention may be described 

by the portion of a polypeptide of the present invention recognized or specifically 
bound by the antibody. Antibody binding fragements of a polypeptide of the present 
invention may be described or specified in the same manner as for polypeptide 
fragements discussed above., i.e, by N-terminal and C-terminal positions or by size in 

25 contiguous amino acid residues. Any number of antibody binding fragments, of a 
polypeptide of the present invention, specified by N-terminal and C-terminal 
positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the 
specifically bind a particuarlly discribed fragement of a polypeptide of the present 
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invention and allows for the exclusion of the same. 

Antibodies and fragements thereof of the present invention may also be 
described or specified in terms of their cross-reactivity. Antibodies and fragements 
that do not bind polypeptides of any other species of Enterococcus other than E. 
5 faecalis are included in the present invention. Likewise, antibodies and fragements 
that bind only species of Enterococcus, i.e. antibodies and fragements that do not bind 
bacteria from any genus other than Enterococcus, are included in the present 
invention. 

1 0 Diagnostic A ssays 

The present invention further relates to methods for assaying staphylococcal 
infection in an animal by detecting the expression of genes encoding staphylococcal 
polypeptides of the present invention. The methods comprise analyzing tissue or 
body fluid from the animal for Enterococcus-specific antibodies, nucleic acids, or 

15 proteins. Analysis of nucleic acid specific to Enterococcus is assayed by PCR or 
hybridization techniques using nucleic acid sequences of the present invention as 
either hybridization probes or primers. See, e.g., Sambrook et al. Molecular cloning: 
A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 
reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing 

20 differentiation among spotted fever group Rickettsiae species by analysis of restriction 
fragment length polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin. 
Microbiol. 32:589-595 (detecting B. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Enterococcus has 
already been made, the present invention is useful for monitoring progression or 

25 regression of the disease state whereby patients exhibiting enhanced Enterococcus 

gene expression will experience a worse clinical outcome relative to patients expressing 
these gene(s) at a lower level. 

By "biological sample 11 is intended any biological sample obtained from an 
animal, cell line, tissue culture, or other source which contains Enterococcus 
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polypeptide, mRNA, or DNA. Biological samples include body fluids (such as saliva, 
blood, plasma, urine, mucus, synovial fluid, etc.) tissues (such as muscle, skin, and 
cartilage) and any other biological source suspected of containing Enterococcus 
polypeptides or nucleic acids. Methods for obtaining biological samples such as 
5 tissue are well known in the art. 

The present invention is useful for detecting diseases related to Enterococcus 
infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, 
cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans. 
Total RNA can be isolated from a biological sample using any suitable 

10 technique such as the single-step guanidinium-thiocyanate-phenol-chloroform method 
described in Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding 
Enterococcus polypeptides having sufficient homology to the nucleic acid sequences 
identified in Table 1 to allow for hybridization between complementary sequences are 
then assayed using any appropriate method. These include Northern blot analysis, SI 

15 nuclease mapping, the polymerase chain reaction (PCR), reverse transcription in 

combination with the polymerase chain reaction (RT-PCR), and reverse transcription 
in combination with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. (1990) 
Cell 63:303-3 12. Briefly, total RNA is prepared from a biological sample as described 

20 above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as 
glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel 
electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been 
linked to the filter by a UV linker, the filter is prehybridized in a solution containing 
formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium 

25 phosphate buffer. A E.faecalis polynucleotide sequence shown in Table 1 labeled 
according to any appropriate method (such as the 32 P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter is 
washed and exposed to x-ray film. DNA for use as probe according to the present 
invention is described in the sections above and will preferably at least 1 5 nucleotides 
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in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 
49:357-367. To prepare probe DNA for use in SI mapping, the sense strand of an 
above-described E.faecalis DNA sequence of the present invention is used as a 
5 template to synthesize labeled antisense DNA. The antisense DNA can then be 
digested using an appropriate restriction endonuclease to generate further DNA 
probes of a desired length. Such antisense probes are useful for visualizing protected 
bands corresponding to the target mRNA (i.e., mRNA encoding Enterococcus 
polypeptides). 

10 Levels of mRNA encoding Enterococcus polypeptides are assayed, for e.g., 

using the RT-PCR method described in Makino et al. (1990) Technique 2:295-301. 
By this method, the radioactivities of the "amplicons" in the polyacrylamide gel bands 
are linearly related to the initial concentration of the target mRNA. Briefly, this 
method involves adding total RNA isolated from a biological sample in a reaction 

15 mixture containing a RT primer and appropriate buffer. After incubating for primer 
annealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase 
inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
of the RNA, the RT products are then subject to PCR using labeled primers. 
Alternatively, rather than labeling the primers, a labeled dNTP can be included in the 

20 PCR reaction mixture. PCR amplification can be performed in a DNA thermal cycler 
according to conventional techniques. After a suitable number of rounds to achieve 
amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel. 
After drying the gel, the radioactivity of the appropriate bands (corresponding to the 
mRNA encoding the Enterococcus polypeptides of the present invention) are 

25 quantified using an imaging analyzer. RT and PCR reaction ingredients and 

coniditions, reagent and gel concentrations, and labeling methods are well known in the 
art. Variations on the RT-PCR method will be apparent to the skilled artisan. Other 
PCR methods that can detect the nucleic acid of the present invention can be found in 
PCR PRIMER: A LABORATORY MANUAL (C.W. Dieffenbach et al. eds., Cold 
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Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, 
may be used to detect polynucleotides of the present invention or Enterococcal 
species including E. faecalis using bio chip technology. The present invention 

5 includes both high density chip arrays (>1000 oligonucleotides per cm 2 ) and low 

density chip arrays (<1000 oligonucleotides per cm 2 ). Bio chips comprising arrays of 
polynucleotides of the present invention may be used to detect Enterococcal species, 
including E. faecalis, in biological and environmental samples and to diagnose an 
animal, including humans, with an E. faecalis or other Enterococcal infection. The bio 

10 chips of the present invention may comprise polynucleotide sequences of other 

pathogens including bacteria, viral, parasitic, and fungal polynucleotide sequences, in 
addition to the polynucleotide sequences of the present invention, for use in rapid 
diffenertial pathogenic detection and diagnosis. The bio chips can also be used to 
monitor an E. faecalis or other Enterococcal infections and to monitor the genetic 

15 changes (deletions, insertions, mismatches, etc.) in response to drug therapy in the 
clinic and drug development in the laboratory. The bio chip technology comprising 
arrays of polynucleotides of the present invention may also be used to simultaneously 
monitor the expression of a multiplicity of genes, including those of the present 
invention. The polynucleotides used to comprise a selected array may be specified in 

20 the same manner as for the fragements, i.e, by their 5' and 3' positions or length in 
contigious base pairs and include from. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E, 
faecalis, using bio chip technology include those known in the art and those of: U.S. 
Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 

25 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, 
WO/951 1995, WO/9743447, WO/9535505, each incorporated herein in their 
entireties. 

Biosensors using the polynucleotides of the present invention may also be 
used to detect, diagnose, and monitor E. faecalis or other Enterococcal species and 
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infections thereof. Biosensors using the polynucleotides of the present invention may 
also be used to detect particular polynucleotides of the present invention. Biosensors 
using the polynucleotides of the present invention may also be used to monitor the 
genetic changes (deletions, insertions, mismatches, etc.) in response to drug therapy in 

5 the clinic and drug development in the laboratory. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E. 
faecalis, using biosenors include those known in the art and those of: U.S. Patent Nos 
5721 102, 5658732, 5631 170, and World Patent Nos. WO97/3501 1, WO/9720203, 
each incorporated herein in their entireties. 

10 Thus, the present invention includes both bio chips and biosensors comprising 

polynucleotides of the present invention and methods of their use. 

Assaying Enterococcus polypeptide levels in a biological sample can occur 
using any art-known method, such as antibody-based techniques. For example, 
Enterococcus polypeptide expression in tissues can be studied with classical 

15 immunohistological methods. In these, the specific recognition is provided by the 
primary antibody (polyclonal or monoclonal) but the secondary detection system can 
utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an 
immunohistological staining of tissue section for pathological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of 

20 Enterococcus polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. 
et al. (1985) J. Cell. Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 
105:3087-3096. In this technique, which is based on the use of cationic solid phases, 
quantitation of a Enterococcus polypeptide can be accomplished using an isolated 
Enterococcus polypeptide as a standard. This technique can also be applied to body 

25 fluids. 

Other antibody-based methods useful for detecting Enterococcus polypeptide 
gene expression include immunoassays, such as the ELIS A and the radioimmunoassay 
(R1A). For example, a Enterococcus polypeptide-specific monoclonal antibodies can 
be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and 
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quantify a Enterococcus polypeptide. The amount of a Enterococcus polypeptide 
present in the sample can be calculated by reference to the amount present in a 
standard preparation using a linear regression computer algorithm. Such an ELISA is 
described in lacobelli et al. (1988) Breast Cancer Research and Treatment 1 1:19-30. In 

5 another ELISA assay, two distinct specific monoclonal antibodies can be used to 

detect Enterococcus polypeptides in a body fluid. In this assay, one of the antibodies 
is used as the immunoabsorbent and the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or 
"two-step" assay. The "one-step" assay involves contacting the Enterococcus 

10 polypeptide with immobilized antibody and, without washing, contacting the mixture 
with the labeled antibody. The "two-step" assay involves washing before contacting 
the mixture with the labeled antibody. Other conventional methods may also be 
employed as suitable. It is usually desirable to immobilize one component of the 
assay system on a support, thereby allowing other components of the system to be 

15 brought into contact with the component and readily removed from the sample. 
Variations of the above and other immunological methods included in the present 
invention can also be found in Harlow et al, ANTIBODIES: A LABORATORY 
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, 

20 which catalyze the production of hydrogen peroxide by reacting with substrate. 
Glucose oxidase is particularly preferred as it has good stability and its substrate 
(glucose) is readily available. Activity of an oxidase label may be assayed by 
measuring the concentration of hydrogen peroxide formed by the enzyme-labeled 
antibody/substrate reaction. Besides enzymes, other suitable labels include 

25 radioisotopes, such as iodine ( 125 1, 121 I), carbon ( 14 C), sulphur ( 35 S), tritium ( 3 H), 

indium ( 112 In), and technetium ( 99m Tc), and fluorescent labels, such as fluorescein and 
rhodamine, and biotin. 

Further suitable labels for the Enterococcus polypeptide-specific antibodies of 
the present invention are provided below. Examples of suitable enzyme labels include 
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malate dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase, yeast-alcohol 
dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and 
5 acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, U1 ln, I25 1, 13I I, 32 P, 35 S, 
,4 C, 5, Cr, 57 To, 58 Co, 59 Fe, 75 Se, 152 Eu, 90 Y, 67 Cu, 217 Ci, 2n At, 212 Pb, 47 Sc, ,09 Pd, etc. 
1 11 In is a preferred isotope where in vivo imaging is used since its avoids the problem 
of dehalogenation of the 125 I or 131 I-labeled monoclonal antibody by the liver. In 
1 0 addition, this radionucleotide has a more favorable gamma emission energy for imaging. 
See, e.g., Perkins et al. (1985) Eur. J. Nucl. Med. 10:296-301; Carasquillo et al. 
(1987) J. Nucl. Med. 28:281-287. For example, m ln coupled to monoclonal 
antibodies with l-(P-isothiocyanatobenzyl)-DPTA has shown little uptake in 
non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 
15 localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

Examples of suitable non-radioactive isotopic labels include 157 Gd, 55 Mn, 
162 Dy, 52 Tr, and 56 Fe. 

Examples of suitable fluorescent labels include an ,52 Eu label, a fluorescein 
label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin 
20 label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, 
ricin, and cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal 
label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an 
25 oxalate ester label, a luciferin label, a luciferase label, and an aequorin label. 

Examples of nuclear magnetic resonance contrasting agents include heavy metal 
nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are 
provided by Kennedy et al. (1 976) Clin. Chim. Acta 70: 1-3 1 , and Schurs et al. (1 977) 
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Clin. Chim. Acta 8 1 : 1-40. Coupling techniques mentioned in the latter are the 
glutaraldehyde method, the periodate method, the dimaleimide method, the 
m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are 
incorporated by reference herein. 
5 In a related aspect, the invention includes a diagnostic kit for use in screening 

serum containing antibodies specific against E.faecalis infection. Such a kit may 
include an isolated E.faecalis antigen comprising an epitope which is specifically 
immunoreactive with at least one anti-Zs. faecalis antibody. Such a kit also includes 
means for detecting the binding of said antibody to the antigen. In specific 

10 embodiments, the kit may include a recombinantly produced or chemically 

synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen 
may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above-described kit 
includes a solid support to which said peptide or polypeptide antigen is attached. 

15 Such a kit may also include a non-attached reporter-labeled anti-human antibody. In 
this embodiment, binding of the antibody to the E.faecalis antigen can be detected by 
binding of the reporter labeled antibody to the anti-is. faecalis polypeptide antibody. 

In a related aspect, the invention includes a method of detecting E.faecalis 
infection in a subject. This detection method includes reacting a body fluid, preferably 

20 serum, from the subject with an isolated E.faecalis antigen, and examining the antigen 
for the presence of bound antibody. In a specific embodiment, the method includes a 
polypeptide antigen attached to a solid support, and serum is reacted with the 
support. Subsequently, the support is reacted with a reporter-labeled anti-human 
antibody. The support is then examined for the presence of reporter-labeled 

25 antibody. 

The solid surface reagent employed in the above assays and kits is prepared 
by known techniques for attaching protein material to solid support material, such as 
polymeric beads, dip sticks, 96-well plates or filter material. These attachment 
methods generally include non-specific adsorption of the protein to the support or 
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covalent attachment of the protein , typically through a free amine group, to a 
chemically reactive group on the solid support, such as an activated carboxyl, 
hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in 
conjunction with biotinylated antigen(s). 
5 The polypeptides and antibodies of the present invention, including fragments 

thereof, may be used to detect Enterococcal species including E.faecalis using bio chip 
and biosensor technology. Bio chip and biosensors of the present invention may 
comprise the polypeptides of the present invention to detect antibodies, which 
specifically recognize Enterococcal species, including E.faecalis. Bio chip and 

10 biosensors of the present invention may also comprise antibodies which specifically 
recognize the polypeptides of the present invention to detect Enterococcal species, 
including E.faecalis or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be 
used to detect Enterococcal species, including E.faecalis, in biological and 

1 5 environmental samples and to diagnose an animal, including humans, with an E. 

faecalis or other Enterococcal infection. Thus, the present invention includes both bio 
chips and biosensors comprising polypeptides or antibodies of the present invention 
and methods of their use. 

The bio chips of the present invention may further comprise polypeptide 

20 sequences of other pathogens including bacteria, viral, parasitic, and fungal 

polypeptide sequences, in addition to the polypeptide sequences of the present 
invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio 
chips of the present invention may further comprise antibodies or fragements thereof 
specific for other pathogens including bacteria, viral, parasitic, and fungal polypeptide 

25 sequences, in addition to the antibodies or fragements thereof of the present invention, 
for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an E. faecalis or other 
Enterococcal infection and to monitor the genetic changes (amio acid deletions, 
insertions, substitutions, etc.) in response to drug therapy in the clinic and drug 
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development in the laboratory. The bio chip and biosensors comprising polypeptides 
or antibodies of the present invention may also be used to simultaneously monitor the 
expression of a multiplicity of polypeptides, including those of the present invention. 
The polypeptides used to comprise a bio chip or biosensor of the present invention 
5 may be specified in the same manner as for the fragements, i.e, by their N-terminal and 
C-terminal positions or length in contigious amino acid residue. Methods and 
particular uses of the polypeptides and antibodies of the present invention to detect 
Enterococcal species, including E.faecalis, or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. 
10 and World Patent Nos. listed above for bio chips and biosensors using 

polynucleotides of the present invention, and those of: U.S. Patent Nos. 5658732, 
5135852, 5567301, 5677196, 5690894 and World Patent Nos. W09729366, 
W096 12957, each incorporated herein in their entireties. 

15 Treatment: 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compounds to identify 
those which enhance or block the biological activity of the E.faecalis polypeptides of 
the present invention. The present invention further provides where the compounds 

20 kill or slow the growth of E.faecalis. The ability of E. faecalis antagonists, including 
E.faecalis ligands, to prophylactically or therapeutically block antibiotic resistance 
may be easily tested by the skilled artisan. See, e.g., Straden et al. (1997) J Bacteriol. 
179(1):9-16. 

An agonist is a compound which increases the natural biological function or 
25 which functions in a manner similar to the polypeptides of the present invention, 
while antagonists decrease or eliminate such functions. Potential antagonists include 
small organic molecules, peptides, polypeptides, and antibodies that bind to a 
polypeptide of the invention and thereby inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross 
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bridge formation. Antibodies against E. faecalis may be employed to bind to and 
inhibit E. faecalis activity to treat antibiotic resistance. Any of the above antagonists 
may be employed in a composition with a pharmaceutically acceptable carrier. 

5 Vaccines 

The present invention also provides vaccines comprising one or more 

polypeptides of the present invention. Heterogeneity in the composition of a vaccine 

may be provided by combining E. faecalis polypeptides of the present invention. 

Multi-component vaccines of this type are desirable because they are likely to be 
10 more effective in eliciting protective immune responses against multiple species and 

strains of the Enterococcus genus than single polypeptide vaccines. 

Multi-component vaccines are known in the art to elicit antibody production 

to numerous immunogenic components. See, e.g., Decker et al. (1996) J. Infect. Dis. 

174:S270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent 
15 vaccine has recently been demonstrated to elicit protective levels of antibodies in 

human infants against all four pathogenic agents. See, e.g., Aristegui, J. et al. (1997) 

Vaccine 15:7-9. 

The present invention in addition to single-component vaccines includes 
multi-component vaccines. These vaccines comprise more than one polypeptide, 

20 immunogen or antigen. Thus, a multi-component vaccine would be a vaccine 

comprising more than one of the E. faecalis polypeptides of the present invention. 

Further within the scope of the invention are whole cell and whole viral 
vaccines. Such vaccines may be produced recombinantly and involve the expression 
of one or more of the E. faecalis polypeptides described in Table 1 . For example, the 

25 E. faecalis polypeptides of the present invention may be either secreted or localized 
intracellular, on ihe cell surface, or in the periplasmic space. Further, when a 
recombinant virus is used, the E. faecalis polypeptides of the present invention may, 
for example, be localized in the viral envelope, on the surface of the capsid, or 
internally within the capsid. Whole cells vaccines which employ cells expressing 
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heterologous proteins are known in the art. See, e.g., Robinson, K. et al. (1997) 
Nature Biotech. 15:653-657; Sirard, J. et al. (1997) Infect. Immun. 65:2029-2033; 
Chabalgoity, J. et al. (1997) Infect. Immun. 65:2402-2412 . These cells may be 
administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, 
5 for example, report the successful use in mice of a live attenuated Salmonella vaccine 
strain which expresses a portion of a platyhelminth fatty acid-binding protein as a 
fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in 
the art by combining one or more E. faecalis polypeptides of the present invention, or 

10 fragments thereof, with additional non-Enterococcal components (e.g., diphtheria 
toxin or tetanus toxin, and/or other compounds known to elicit an immune response). 
Such vaccines are useful for eliciting protective immune responses to both members of 
the Enterococcus genus and non-Enterococcal pathogenic agents. 

The vaccines of the present invention also include DNA vaccines. DNA 

15 vaccines are currently being developed for a number of infectious diseases. See, et al, 
Boyer, et al. (1997) Nat. Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 
14:1285-1288. Such DNA vaccines contain anucleotide sequence encoding one or 
more E. faecalis polypeptides of the present invention oriented in a manner that 
allows for expression of the subject polypeptide. For example, the direct 

20 administration of plasmid DNA encoding B. burgdorgeri OspA has been shown to 
elicit protective immunity in mice against borrelial challenge. See, Luke et al. (1997) J. 
Infect. Dis. 175:91-97. 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim et al. 

25 ( 1 997) Nature Biotech. 1 5:64 1 -646, for example, report the enhancement of immune 
responses produced by DNA immunizations when DNA sequences encoding 
molecules which stimulate the immune response are co-administered. In a similar 
fashion, the vaccines of the present invention may be co-administered with either 
nucleic acids encoding immune modulators or the immune modulators themselves. 
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These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

The vaccines of the present invention may be used to confer resistance to 
Enterococcal infection by either passive or active immunization. When the vaccines of 
5 the present invention are used to confer resistance to Enterococcal infection through 
active immunization, a vaccine of the present invention is administered to an animal to 
elicit a protective immune response which either prevents or attenuates a Enterococcal 
infection. When the vaccines of the present invention are used to confer resistance to 
Enterococcal infection through passive immunization, the vaccine is provided to a host 

10 animal (e.g., human, dog, or mouse), and the antisera elicited by this antisera is 

recovered and directly provided to a recipient suspected of having an infection caused 
by a member of the Enterococcus genus. 

The ability to label antibodies, or fragments of antibodies, with toxin molecules 
provides an additional method for treating Enterococcal infections when passive 

15 immunization is conducted. In this embodiment, antibodies, or fragments of 

antibodies, capable of recognizing the E. faecalis polypeptides disclosed herein, or 
fragments thereof, as well as other Enterococcus proteins, are labeled with toxin 
molecules prior to their administration to the patient. When such toxin derivatized 
antibodies bind to Enterococcus cells, toxin moieties will be localized to these cells and 

20 will cause their death. 

The present invention thus concerns and provides a means for preventing or 
attenuating a Enterococcal infection resulting from organisms which have antigens that 
are recognized and bound by antisera produced in response to the polypeptides of the 
present invention. As used herein, a vaccine is said to prevent or attenuate a disease if 

25 its administration to an animal results either in the total or partial attenuation (i.e., 
suppression) of a symptom or condition of the disease, or in the total or partial 
immunity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for 
either a "prophylactic" or "therapeutic" purpose. When provided prophylactic.ally, 
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the coinpound(s) are provided in advance of any symptoms of Enterococcal infection. 
The prophylactic administration of the compound(s) serves to prevent or attenuate 
any subsequent infection. When provided therapeutically, the compound(s) is 
provided upon or after the detection of symptoms which indicate that an animal may 
5 be infected with a member of the Enterococcus genus. The therapeutic administration 
of the compound(s) serves to attenuate any actual infection. Thus, the E. faecalis 
polypeptides, and fragments thereof, of the present invention may be provided either 
prior to the onset of infection (so as to prevent or attenuate an anticipated infection) 
or after the initiation of an actual infection. 

10 The polypeptides of the invention, whether encoding a portion of a native 

protein or a functional derivative thereof, may be administered in pure form or may be 
coupled to a macromolecular carrier. Example of such carriers are proteins and 
carbohydrates. Suitable proteins which may act as macromolecular carrier for 
enhancing the immunogenicity of the polypeptides of the present invention include 

15 keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum 
albumin, and ovalbumin. Methods for coupling the polypeptides of the present 
invention to such macromolecular carriers are disclosed in Harlow et al., 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). 

20 A composition is said to be "pharmacologically or physiologically acceptable" 

if its administration can be tolerated by a recipient animal and is otherwise suitable for 
administration to that animal. Such an agent is said to be administered in a 
"therapeutically effective amount" if the amount administered is physiologically 
significant. An agent is physiologically significant if its presence results in a 

25 detectable change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that 
the composition of a pharmacologically acceptable compound varies with the animal 
to which it is administered. For example, a vaccine intended for human use will 
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generally not be co-administered with Freund's adjuvant. Further, the level of purity 
of the E. faecalis polypeptides of the present invention will normally be higher when 
administered to a human than when administered to a non-human animal. 

As would be understood by one of ordinary skill in the art, when the vaccine 

5 of the present invention is provided to an animal, it may be in a composition which 
may contain salts, buffers, adjuvants, or other substances which are desirable for 
improving the efficacy of the composition. Adjuvants are substances that can be used 
to specifically augment a specific immune response. These substances generally 
perform two functions: (1) they protect the antigen(s) from being rapidly catabolized 

10 after administration and (2) they nonspecifically stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to 
the immune system, or presented separately, but into the same site of the animal being 
immunized. Adjuvants can be loosely divided into several groups based upon their 
composition. These groups include oil adjuvants (for example, Freund f s complete and 

15 incomplete), mineral salts (for example, A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ), silica, 
kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and 
certain natural substances (for example, wax D from Mycobacterium tuberculosis, as 
well as substances found in Corynebacterium parvum, or Bordetella pertussis, and 
members of the genus Brucella. Other substances useful as adjuvants are the saponins 

20 such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in 
the present invention include aluminum salts, such as A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , and 
A1NH 4 (S0 4 ). Examples of materials suitable for use in vaccine compositions are 
provided in REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A. 
Osol, ed, Mack Publishing Co, Easton, PA, (1980) (incorporated herein by reference). 

25 The therapeutic compositions of the present invention can be administered 

parenterally by injection, rapid infusion, nasopharyngeal absorption 
(intranasopharangeally), dermoabsorption, or orally. The compositions may 
alternatively be administered intramuscularly, or intravenously. Compositions for 
parenteral administration include sterile aqueous or non-aqueous solutions, 
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suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such 
as ethyl oleate. Carriers or occlusive dressings can be used to increase skin 
permeability and enhance antigen absorption. Liquid dosage forms for oral 
5 administration may generally comprise a liposome solution containing the liquid 
dosage form. Suitable forms for suspending liposomes include emulsions, suspen- 
sions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, 
such as purified water. Besides the inert diluents, such compositions can also include 
adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, 

1 o flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization using vaccines encapsulated 
in biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See, 
Shahin, R. et al. (1995) Infect. Immun. 63:1 195-1200. Similarly, orally administered 

1 5 encapsulated Salmonella typhimurium antigens can also be used. Allaoui-Attarki, K. 
et al. (1997) Infect. Immun. 65:853-857. Encapsulated vaccines of the present 
invention can be administered by a variety of routes including those involving 
contacting the vaccine with mucous membranes {e.g., intranasally, intracolonicly, 
intraduodenally). 

20 Many different techniques exist for the timing of the immunizations when a 

multiple administration regimen is utilized. It is possible to use the compositions of 
the invention more than once to increase the levels and diversities of expression of the 
immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple 
immunizations are given, they will be given one to two months apart. 

25 According to the present invention, an "effective amount" of a therapeutic 

composition is one which is sufficient to achieve a desired biological effect. Generally, 
the dosage needed to provide an effective amount of the composition will vary 
depending upon such factors as the animal's or human's age, condition, sex, and extent 
of disease, if any, and other variables which can be adjusted by one of ordinary skill in 
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the art. 

The antigenic preparations of the invention can be administered by either 
single or multiple dosages of an effective amount. Effective amounts of the 
compositions of the invention can vary from 0.01-1,000 ^g/ml per dose, more 
5 preferably 0.1-500 [ig/m\ per dose, and most preferably 10-300 |ig/ml per dose. 

Examples 

Example 1: Isolation of a Selected DNA Clone From the Deposited Sample ofE. 
faecalis 

10 Three approaches can be used to isolate a E. faecalis clone comprising a 

polynucleotide of the present invention from any E. faecalis genomic DNA library. 
The E. faecalis strain V586 has been deposited as a convienent source for obtaining a 
E. faecalis strain although a wide varity of strains E. faecalis strains can be used which 
are known in the art. 

1 5 E. faecalis genomic DNA is prepared using the following method. A 20ml 

overnight bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain 
Heart Infusion broth or Super broth), pelleted, washed two times with TES (30mM 
Tris-pH 8.0, 25mM EDTA, 50mM NaCl), and resuspended in 5ml high salt TES 
(2.5M NaCl). Lysostaphin is added to final concentration of approx 50ug/ml and the 

20 mixture is rotated slowly 1 hour at 37C to make protoplast cells. The solution is then 
placed in incubator (or place in a shaking water bath) and warmed to 55C. Five 
hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is then added to 
lyse the cells. Next, guanidine HC1 is added to a final concentration of 7M (3.69g in 
5.5 ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear). 

25 A CsCl gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCl and 
overlaying with 2.85M CsCl. The gradient is carefully overlayed with the DNA- 
containing GuHCl solution. The gradient is spun at 30,000 rpm, 20C for 24 hr and 
the lower DNA band is collected. The volume is increased to 5 ml with TE buffer. 
The DNA is then treated with protease K (10 ug/ml) overnight at 37 C, and 
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precipitated with ethanol. The precipitated DNA is resuspended in a desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid E. 
faecalis genomic DNA library using a polynucleotide probe corresponding to a 
polynucleotide of the present invention. Particularly, a specific polynucleotide with 

5 30-40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer 
according to the sequence reported. The oligonucleotide is labeled, for instance, with 
32 P-y-ATP using T4 polynucleotide kinase and purified according to routine methods. 
{See, e.g., Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Press, Cold Spring, NY (1982).) The library is transformed into a suitable 

10 host, as indicated above (such as XL-1 Blue (Stratagene)) using techniques known to 
those of skill in the art. See, e.g., Sambrook et al. MOLECULAR CLONING: A 
LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al, 
CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, 
N.Y. 1989). The transformants are plated on 1 .5% agar plates (containing the 

15 appropriate selection agent, e.g., ampicillin) to a density of about 150 transformants 
(colonies) per plate. These plates are screened using Nylon membranes according to 
routine methods for bacterial colony screening. See, e.g., Sambrook et al. 
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, 
N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR 

20 BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of 
skill in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends 
of a polynucleotide of Table 1 are synthesized and used to amplify the desired DNA 
by PCR using a E. faecalis genomic DNA prep as a template. PCR is carried out 
25 under routine conditions, for instance, in 25 |j,l of reaction mixture with 0.5 ug of the 
above DNA template. A convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01 % 
(w/v) gelatin, 20 M.M each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 
0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94 °C for 1 
min; annealing at 55°C for 1 min; elongation at 72°C for 1 min) are performed with a 
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Perkin-Elmer Cetus automated thermal cycler. The amplified product is analyzed by 
agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 
5 Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically 

synthesized and used to generate a nucleotide sequence of desired length using PCR 
methods known in the art. 

Example 2(a): Expression and Purification Enterococcal polypeptides in E. coli 

10 The bacterial expression vector pQE60 was used for bacterial expression of 

some of the polypeptide fragements used in the soft tissue and systemic infection 
models discussed below. (Q1AGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains a 
bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome binding 

15 site ("RBS"), six codons encoding histidine residues that allow affinity purification 
using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) 
and suitable single restriction enzyme cleavage sites. These elements are arranged such 
that an inserted DNA fragment encoding a polypeptide expresses that polypeptide 
with the six His residues (i.e., a n 6 X His tag") covalently linked to the carboxyl 

20 terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a E.faecalis protein of the 
present invention was amplified from E.faecalis genomic DNA using PCR 
oligonucleotide primers which anneal to the 5' and 3' sequences coding for the 
portions of the E.faecalis polynucleotide shown in Table 1. Additional nucleotides 

25 containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' 
and 3* sequences, respectively. 

For cloning the mature protein, the 5' primer has a sequence containing an 
appropriate restriction site followed by nucleotides of the amino terminal coding 
sequence of the desired E.faecalis polynucleotide sequence in Table 1 . One of 
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ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of the complete protein shorter or longer than the mature form. 
The 3 ! primer has a sequence containing an appropriate restriction site followed by 
5 nucleotides complementary to the 3 f end of the polypeptide coding sequence of Table 
1, excluding a stop codon, with the coding sequence aligned with the restriction site so 
as to maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified E. faecalis DNA fragment and the vector pQE60 were digested 
with restriction enzymes which recognize the sites in the primers and the digested 

1 0 DNAs were then ligated together. The E. faecalis DNA was inserted into the 

restricted pQE60 vector in a manner which places the E. faecalis protein coding region 
downstream from the IPTG-inducible promoter and in-frame with an initiating AUG 
and the six histidine codons. 

The ligation mixture was transformed into competent E. coli cells using 

15 standard procedures such as those described by Sambrook et al., supra.. E. coli strain 
M15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac 
repressor and confers kanamycin resistance ("Kanr"), was used in carrying out the 
illustrative example described herein. This strain, which was only one of many that 
are suitable for expressing a E. faecalis polypeptide, is available commercially 

20 (Q1AGEN, Inc., supra). Transformants were identified by their ability to grow on LB 
agar plates in the presence of ampicillin and kanamycin. Plasmid DNA was isolated 
from resistant colonies and the identity of the cloned DNA confirmed by restriction 
analysis, PCR and DNA sequencing. 

Clones containing the desired constructs were grown overnight ("O/N") in 

25 liquid culture in LB media supplemented with both ampicillin (100 |ig/ml) and 
kanamycin (25 [ig/m\). The O/N culture was used to inoculate a large culture, at a 
dilution of approximately 1 :25 to 1 :250. The cells were grown to an optical density at 
600 nm ("00600") of between 0.4 and 0.6. Jsopropyl-p-D-thiogalactopyranoside 
("1PTG") was then added to a final concentration of 1 mM to induce transcription 
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from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently were incubated further for 3 to 4 hours. Cells then were harvested by 
centrifugation. 

The cells were then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. 
5 The cell debris was removed by centrifugation, and the supernatant containing the E. 
faecalis polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin column (Q1AGEN, Inc., supra). Proteins with a 6 x His tag bind to the 
Ni-NTA resin with high affinity were purified in a simple one-step procedure (for 
details see: The QIAexpressionist, 1995, Q1AGEN, Inc., supra). Briefly the 
10 supernatant was loaded onto the column in 6 M guanidine-HCl, pH 8, the column was 
first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with 10 
volumes of 6 M guanidine-HCl pH 6, and finally the E. faecalis polypeptide was 
eluted with 6 M guanidine-HCl, pH 5. 

The purified protein was then renatured by dialyzing it against 
15 phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM 
NaCl. Alternatively, the protein could be successfully refolded while immobilized on 
the Ni-NTA column. The recommended conditions are as follows: renature using a 
linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
containing protease inhibitors. The renaturation should be performed over a period of 
20 1 .5 hours or more. After renaturation the proteins can be eluted by the addition of 
250 mM immidazole. Immidazole was removed by a final dialyzing step against PBS 
or 50 mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein was 
stored at 4° C or frozen at -80° C. 

Some of the polypeptide of the present invention were prepared using a non- 
25 denaturing protein purification method. For these polypeptides, the cell pellet from 
each liter of culture was resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer 
A = 50 mM Na-phosphate, 300 mM NaCl, 10 mM 2-mercaptoethanol, 10% 
Glycerol, pH 7.5 with 1 tablet of Complete EDTA-free protease inhibitor cocktail 
(Boehringer Mannheim #1873580) per 50 ml of buffer). Absorbance at 550 nm was 
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approximately 10-20 O.D./ml. The suspension was then put through three 
freeze/thaw cycles from -70°C (using a ethanol-dry ice bath) up to room temperature. 
The cells were lysed via sonication in short 10 sec bursts over 3 minutes at 
approximately 80W while kept on ice. The sonicated sample was then centrifuged at 
5 15,000 RPM for 30 minutes at 4°C. The supernatant was passed through a column 
containing 1 .0 ml of CL-4B resin to pre-clear the sample of any proteins that may 
bind to agarose non-specifically, and the flow-through fraction was collected. 

The pre-cleared flow-through was applied to a nickel-nitrilo-tri-acetic acid 
("Ni-NTA") affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag 

1 0 bind to the Ni-NTA resin with high affinity and can be purified in a simple one-step 
procedure. Briefly, the supernatant was loaded onto the column in Lysis Buffer A at 
4°C, the column was first washed with 10 volumes of Lysis Buffer A until the A280 
of the eluate returns to the baseline. Then, the column was washed with 5 volumes of 
40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) (Buffer B = 50 mM Na- 

15 Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 500 mM 
Imidazole, pH of the final buffer should be 7.5). The protein was eluted off of the 
column with a series of increasing Imidazole solutions made by adjusting the ratios of 
Lysis Buffer A to Buffer B. Three different concentrations were used: 3 volumes of 
75 mM Imidazole, 3 volumes of 150 mM Imidazole, 5 volumes of 500 mM 

20 Imidazole. The fractions containing the purified protein were analyzed using 8 %, 10 
% or 14% SDS-PAGE depending on the protein size. The purified protein was then 
dialyzed 2X against phosphate-buffered saline (PBS) in order to place it into an easily 
workable buffer. The purified protein was stored at 4° C or frozen at -80°. 

The following alternative method may be used to purify E.faecalis expressed 

25 in E coli when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-1 0°C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 
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unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 
5 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 1 5 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

10 The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the 
pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

1 5 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 12 hours prior to further purification steps. 

To clarify the refolded E.faecalis polypeptide solution, a previously prepared 

20 tangential filtration unit equipped with 0.16 ^tm membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

25 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E.faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 
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Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 
5 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. 
Fractions are collected under constant A 2 go monitoring of the effluent. Fractions 
containing the E.faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E.faecalis polypeptide exhibits greater than 95% purity after 
10 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 |Lig of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0.1 ng/ml according to LAL assays. 

1 5 Example 2(b): Alternative Expression and Purification Enterococcal polypeptides in E. 
coli 

Tthe vector pQElO was alternatively used to clone and express some of the 
polypeptides of the present invention for use in the soft tissue and systemic infection 
models discussed below. The difference being such that an inserted DNA fragment 

20 encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 
X His tag") covalently linked to the amino terminus of that polypeptide. The bacterial 
expression vector pQElO (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311) was used in this example . The components of the pQElO plasmid are 
arranged such that the inserted DNA sequence encoding a polypeptide of the present 

25 invention expresses the polypeptide with the six His residues {i.e., a "6 X His tag")) 
covalently linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 
1 were amplified using PCR oligonucleotide primers from genomic E.faecalis DNA. 
The PCR primers anneal to the nucleotide sequences encoding the desired amino acid 
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sequence of a polypeptide of the present invention. Additional nucleotides containing 
restriction sites to facilitate cloning in the pQEl 0 vector were added to the 5' and 3 ! 
primer sequences, respectively. 

For cloning a polypeptide of the present invention, the 5* and 3' primers were 
5 selected to amplify their respective nucleotide coding sequences. One of ordinary skill 
in the art would appreciate that the point in the protein coding sequence where the 5 1 
and 3' primers begins may be varied to amplify a DNA segment encoding any desired 
portion of a polypeptide of the present invention. The 5 ! primer was designed so the 
coding sequence of the 6 X His tag is aligned with the restriction site so as to maintain 
10 its reading frame with that of E.faecalis polypeptide. The 3' was designed to include 
an stop codon. The amplified DNA fragment was then cloned, and the protein 
expressed, as described above for the pQE60 plasmid. 

The DNA sequences encoding the amino acid sequences of Table 1 may also 
be cloned and expressed as fusion proteins by a protocol similar to that described 
15 directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, 
Madison, WI 5371 1) is preferentially used in place of pQElO. 

The above methods are not limited to the polypeptide fragements actually 
produced. The above method, like the methods below, can be used to produce either 
full length polypeptides or desired fragements therof. 

20 

Example 2(c): Alternative Expression and Purification of Enterococcal polypeptides 
in E. coli 

The bacterial expression vector pQE60 is used for bacterial expression in this 
example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 9131 1). However, in 
25 this example, the polypeptide coding sequence is inserted such that translation of the 
six His codons is prevented and, therefore, the polypeptide is produced with no 6 X 
His tag. 

The DNA sequence encoding the desired portion of the E.faecalis amino acid 
sequence is amplified from an E.faecalis genomic DNA prep the deposited DNA 
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clones using PCR oligonucleotide primers which anneal to the 5' and 3' nucleotide 
sequences corresponding to the desired portion of the E. faecalis polypeptides. 
Additional nucleotides containing restriction sites to facilitate cloning in the pQE60 
vector are added to the 5' and 3* primer sequences. 
5 For cloning a E. faecalis polypeptides of the present invention, 5' and 3' 

primers are selected to amplify their respective nucleotide coding sequences. One of 
ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5 1 and 3' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of a polypeptide of the present invention. The 3' and 5' primers 

10 contain appropriate restriction sites followed by nucleotides complementary to the 5' 
and 3' ends of the coding sequence respectively. The 3' primer is additionally designed 
to include an in-frame stop codon. 

The amplified E. faecalis DNA fragments and the vector pQE60 are digested 
with restriction enzymes recognizing the sites in the primers and the digested DNAs 

15 are then ligated together. Insertion of the E. faecalis DNA into the restricted pQE60 
vector places the E. faecalis protein coding region including its associated stop codon 
downstream from the IPTG-inducible promoter and in-frame with an initiating AUG. 
The associated stop codon prevents translation of the six histidine codons 
downstream of the insertion point. 

20 The ligation mixture is transformed into competent E. coli cells using standard 

procedures such as those described by Sambrook et al. E. coli strain M15/rep4, 
containing multiple copies of the plasmid pREP4, which expresses the lac repressor 
and confers kanamycin resistance ("Kanr"), is used in carrying out the illustrative 
example described herein. This strain, which is only one of many that are suitable for 

25 expressing E. faecalis polypeptide, is available commercially (Q1AGEN, Inc., supra). 
Transformants are identified by their ability to grow on LB plates in the presence of 
ampicillin and kanamycin. Plasmid DNA is isolated from resistant colonies and the 
identity of the cloned DNA confirmed by restriction analysis, PCR and DNA 
sequencing. 
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Clones containing the desired constructs are grown overnight ("O/N") in liquid 
culture in LB media supplemented with both ampicillin (100 \ig/m\) and kanamycin 
(25 |lg/ml). The O/N culture is used to inoculate a large culture, at a dilution of 
approximately 1 :25 to 1 :250. The cells are grown to an optical density at 600 nm 
5 ("OD600") of between 0.4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("IPTG") is 
then added to a final concentration of 1 mM to induce transcription from the lac 
repressor sensitive promoter, by inactivating the lacl repressor. Cells subsequently 
are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the E. faecalis polypeptide, the cells are then stirred for 3-4 hours at 
10 4°C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the 
supernatant containing the E. faecalis polypeptide is dialyzed against 50 mM Na- 
acetate buffer pH 6, supplemented with 200 mM NaCl. Alternatively, the protein 
can be successfully refolded by dialyzing it against 500 mM NaCl, 20% glycerol, 25 
mM Tris/HCl pH 7.4, containing protease inhibitors. After renaturation the protein 
1 5 can be purified by ion exchange, hydrophobic interaction and size exclusion 

chromatography. Alternatively, an affinity chromatography step such as an antibody 
column can be used to obtain pure E. faecalis polypeptide. The purified protein is 
stored at 4° C or frozen at -80° C. 

The following alternative method may be used to purify E. faecalis 
20 polypeptides expressed in E coli when it is present in the form of inclusion bodies. 
Unless otherwise specified, all of the following steps are conducted at 4-1 0°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-1 0°C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 
25 unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer 
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(Microfuidics, Corp. or APV Gaulin, Inc.) twice al 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 1 5 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 
5 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 1 5 min., the 
pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

10 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 12 hours prior to further purification steps. 

To clarify the refolded E.faecalis polypeptide solution, a previously prepared 

1 5 tangential filtration unit equipped with 0. 1 6 |i.m membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

20 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E.faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 

25 Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 
mM sodium acetate, pH 6.0 to 1 .0 M NaCl, 50 mM sodium acetate, pH 6.5. 
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Fractions are collected under constant A 2 so monitoring of the effluent. Fractions 
containing the E.faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E.faecalis polypeptide exhibits greater than 95% purity after 
5 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 jig of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0.1 ng/ml according to LAL assays. 

1 0 Example 2(d): Cloning and Expression of E. faecalis in Other Bacteria 

E.faecalis polypeptides can also be produced in: E.faecalis using the methods 
of S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. I. Moreno (1996) Protein 
Expr. Purif. 8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 
Appl. Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods 

15 Chang et al., U.S. Patent No. 4,952,508. 

Example 3: Cloning and Expression in COS Cells 

A E. faecalis expression plasmid is made by cloning a portion of the DNA 
encoding a E.faecalis polypeptide into the expression vector pDNAI/Amp or 

20 pDNAIII (which can be obtained from Invitrogen, Inc.). The expression vector 

pDNAl/amp contains: (1) an E. coli origin of replication effective for propagation in 
E. coli and other prokaryotic cells; (2) an ampicillin resistance gene for selection of 
plasmid-containing prokaryotic cells; (3) an SV40 origin of replication for propagation 
in eukaryotic cells; (4) a CMV promoter, a polylinker, an SV40 intron; (5) several 

25 codons encoding a hemagglutinin fragment (i.e., an "HA" tag to facilitate purification) 
followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and 
operably linked to the SV40 intron and the polyadenylation signal by means of 
restriction sites in the polylinker. The HA tag corresponds to an epitope derived 
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from the influenza hemagglutinin protein described by Wilson et al. 1984 Cell 37:767. 
The fusion of the HA tag to the target protein allows easy detection and recovery of 
the recombinant protein with an antibody that recognizes the HA epitope. pDNAIII 
contains, in addition, the selectable neomycin marker. 
5 A DNA fragment encoding a E. faecalis polypeptide is cloned into the 

polylinker region of the vector so that recombinant protein expression is directed by 
the CMV promoter. The plasmid construction strategy is as follows. The DNA from 
a E. faecalis genomic DNA prep is amplified using primers that contain convenient 
restriction sites, much as described above for construction of vectors for expression of 

10 E. faecalis in E. coli. The 5' primer contains a Kozak sequence, an AUG start codon, 
and nucleotides of the 5' coding region of the E. faecalis polypeptide. The 3 f primer, 
contains nucleotides complementary to the 3' coding sequence of the E. faecalis DNA, 
a stop codon, and a convenient restriction site. 

The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested 

15 with appropriate restriction enzymes and then ligated. The ligation mixture is 
transformed into an appropriate E. coli strain such as SURE™ (Stratagene Cloning 
Systems, La Jolla, CA 92037), and the transformed culture is plated on ampicillin 
media plates which then are incubated to allow growth of ampicillin resistant colonies. 
Plasmid DNA is isolated from resistant colonies and examined by restriction analysis 

20 or other means for the presence of the fragment encoding the E. faecalis polypeptide 
For expression of a recombinant E. faecalis polypeptide, COS cells are 
transfected with an expression vector, as described above, using DEAE-dextran, as 
described, for instance, by Sambrook et al. (supra). Cells are incubated under 
conditions for expression of E. faecalis by the vector. 

25 Expression of the E.faecalis-HA fusion protein is detected by radiolabeling 

and immunoprecipitation, using methods described in, for example Harlow et al., 
supra.. To this end, two days after transfection, the cells are labeled by incubation in 
media containing 35 S-cysteine for 8 hours. The cells and the media are collected, and 
the cells are washed and the lysed with detergent-containing R]PA buffer: 150 mM 
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NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRJS, pH 7.5, as 
described by Wilson et al. (supra ). Proteins are precipitated from the cell lysate and 
from the culture media using an HA-specific monoclonal antibody. The precipitated 
proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
5 product of the expected size is seen in the cell lysate, which is not seen in negative 
controls. 

Example 4: Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of E.faecalis polypeptide in this 

1 o example. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession 
No. 37146). The plasmid contains the mouse DHFR gene under control of the SV40 
early promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate 
activity that are transfected with these plasmids can be selected by growing the cells 
in a selective medium (alpha minus MEM, Life Technologies) supplemented with the 

15 chemotherapeutic agent methotrexate. The amplification of the DHFR genes in cells 
resistant to methotrexate (MTX) has been well documented. See, e.g., Alt et al., 
1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. Acta, 
1097:107-143; Page etal., 1991, Biotechnology 9:64-68. Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the target 

20 enzyme, DHFR, as a result of amplification of the DHFR gene. If a second gene is 
linked to the DHFR gene, it is usually co-amplified and over-expressed. It is known 
in the art that this approach may be used to develop cell lines carrying more than 
1,000 copies of the amplified gene(s). Subsequently, when the methotrexate is 
withdrawn, cell lines are obtained which contain the amplified gene integrated into one 

25 or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) 
of the Rouse Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. 
(1985) Mol. Cell. Biol. 5:438-447; plus a fragment isolated from the enhancer of the 
immediate early gene of human cytomegalovirus (CMV), Boshart, et al., 1985, Cell 
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41 :521 -530. Downstream of the promoter are the following single restriction enzyme 
cleavage sites that allow the integration of the genes: Bam HI, Xba I, and Asp 718. 
Behind these cloning sites the plasmid contains the 3' intron and polyadenylation site 
of the rat preproinsulin gene. Other high efficiency promoters can also be used for the 

5 expression, e.g., the human fl-actin promoter, the SV40 early or late promoters or the 
long terminal repeats from other retroviruses, e.g., HIV and HTLVI. Clontech's Tet- 
Off and Tet-On gene expression systems and similar systems can be used to express 
the E.faecalis polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, 
Proc. Natl Acad. Sci. USA 89:5547-555 1 . For the polyadenylation of the mRNA 

1 0 other signals, e.g., from the human growth hormone or globin genes can be used as 
well. Stable cell lines carrying a gene of interest integrated into the chromosomes can 
also be selected upon co-transfection with a selectable marker such as gpt, G418 or 
hygromycin. It is advantageous to use more than one selectable marker in the 
beginning, e.g., G4 18 plus methotrexate. 

15 The plasmid pC4 is digested with the restriction enzymes and then 

dephosphorylated using calf intestinal phosphates by procedures known in the art. 
The vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E, 
faecalis polypeptide is amplified using PCR oligonucleotide primers corresponding to 
the 5' and 3* sequences of the desired portion of the gene. A 5' primer containing a 

20 restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the 5 ! 
coding region of the E.faecalis polypeptide is synthesized and used. A 3' primer, 
containing a restriction site, stop codon, and nucleotides complementary to the 3' 
coding sequence of the E.faecalis polypeptides is synthesized and used. The 
amplified fragment is digested with the restriction endonucleases and then purified 

25 again on a 1 % agarose gel. The isolated fragment and the dephosphorylated vector are 
then ligated with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
pC4 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for 
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transfection. Five |ig of the expression plasmid pC4 is cotransfected with 0.5 |ig of 
the plasmid pSVneo using a lipid-mediated transfection agent such as Lipofectin™ or 
LipofectAMINE.™ (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 

5 that confers resistance to a group of antibiotics including G41 8. The cells are seeded 
in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
minus MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 

10 petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 

100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
concentrations of methotrexate (1 |lM, 2 pM, 5 pM, 10 mM, 20 mM), The same 
procedure is repeated until clones are obtained which grow at a concentration of 

1 5 1 00-200 |iM. Expression of the desired gene product is analyzed, for instance, by 
SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

Example 5: Quantitative Murine Soft Tissue Infection Model for E. faecalis 

Compositions of the present invention, including polypeptides and peptides, 

20 are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E. faecalis) using the following quantitative 
murine soft tissue infection model. Mice (e.g., NIH Swiss female mice, approximately 
7 weeks old) are first treated with a biologically protective effective amount, or 
immune enhancing/stimulating effective amount of a composition of the present 

25 invention using methods known in the art, such as those discussed above. See, e.g., 
Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug 
per animal. 



WO 98/50554 



-76- 



PCI7US98/08959 



The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 
cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 with sterilized Cytodex 3 microcarrier beads preswollen 

5 in sterile PBS (3g/100ml). Mice are anesthetize briefly until docile, but still mobile 
and injected with 0.2 ml of the Cytodex 3 beac^acterial mixture into each animal 
subcutaneously in the inguinal region. After four days, counting the day of injection 
as day one, mice are sacrificed and the contents of the abscess is excised and placed in 
a 15 ml conical tube containing 1.0ml of sterile PBS. The contents of the abscess is 

10 then enzymaticaily treated and plated as follows. 

The abscess is first disrupted by vortexing with sterilized glass beads placed in 
the tubes. 3.0mls of prepared enzyme mixture (1 .0ml Collagenase D (4.0 mg/ml), 
1 .0ml Trypsin (6.0 mg/ml) and 8.0 mis PBS) is then added to each tube followed by a 
20 min. incubation at 37C. The solution is then centrifuged and the supernatant 

1 5 drawn off. 0.5 ml dH20 is then added and the tubes are vortexed and then incubated 
for 10 min. at room temperature. 0.5 ml media is then added and samples are serially 
diluted and plated onto agar plates, and grown overnight at 37C Plates with distinct 
and separate colonies are then counted, compared to positive and negative control 
samples, and quantified. The method can be used to identify composition and 

20 determine appropriate and effective doses for humans and other animals by comparing 
the effective doses of compositions of the present invention with compositions 
known in the art to be effective in both mice and humans. Doses for the effective 
treatment of humans and other animals, using compositions of the present invention, 
are extrapolated using the data from the above experiments of mice. It is appreciated 

25 that further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 

Example 6: Murine Systemic Neutropenic Model for E.faecalis Infection 
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Compositions of the present invention, including polypeptides and peptides, 
are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E.faecalis) using the following qualitative murine 
systemic neutropenic model. Mice (e.g., NIH Swiss female mice, approximately 7 

5 weeks old) are first treated with a biologically protective effective amount, or immune 
enhancing/stimulating effective amount of a composition of the present invention 
using methods known in the art, such as those discussed above. See,e.g. t Harlow et 
al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug per animal. 

10 Mice are then injected with 250 - 300 mg/kg cyclophosphamide intraperitonially. 

Counting the day of CP. injection as day one, the mice are left untreated for 5 days to 
begin recovery of PMNL'S. 

The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 

15 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 in 4% Brewer's yeast in media. 
Mice are injected with the bacteria/brewer's yeast challenge intraperitonially. The 
Brewer's yeast solution alone is used as a control. The mice are then monitered twice 
daily for the first week following challenge, and once a day for the next week to 

20 ascertain morbidity and mortality. Mice remaining at the end of the experiment are 
sacrificed. The method can be used to identify compositions and determine 
appropriate and effective doses for humans and other animals by comparing the 
effective doses of compositions of the present invention with compositions known in 
the art to be effective in both mice and humans. Doses for the effective treatment of 

25 humans and other animals, using compositions of the present invention, are 

extrapolated using the data from the above experiments of mice. It is appreciated that 
further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 
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The disclosure of all publications (including patents, patent applications, 
journal articles, laboratory manuals, books, or other documents) cited herein are 
hereby incorporated by reference in their entireties. 

The present invention is not to be limited in scope by the specific 
5 embodiments described herein, which are intended as single illustrations of individual 
aspects of the invention. Functionally equivalent methods and components are within 
the scope of the invention, in addition to those shown and described herein and will 
become apparant to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of 
10 the appended claims. 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

EF001-1 (SEQ ID NO:l) 

TGAAAGAATA TTGCCAGAAC GTGGCGAGCA AATTGTTTTA TAAATTTTTT TAAGGGAGAG 
AAAAAAATGA AGTTCAAAAC TCTAGCAACA ACAGTGTTAG CAACCGCAGC TATTTTCGCA 
TTGGGGGCTT GTGGTAACGG TAATGGGGCC AAAGAATCAA ACGATATTGT GAAAGAAGTG 
AAGGAAGATA CGACAATCAC TTTCTGGCAT GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
ACAAAATTAA CGAAAGACTT CATGAAAGAA AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TTACCAACAA TTACGCAAGC GTACCCAGGC TGGTTATGGA ATGCTGCACA AGATGAAATG 
TTAGTGGACT TAAAAC C ATA TATGGATGAT GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA ATCGACGGCA AACAATACGG CATTCCATTT 
AATAAATCGA CAGAAATGTT ATTCTATAAT GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA GCTTCTAAAA CAATTTACGA AAAATCCAAC 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG TTAAATAACT ATTACGCAAT TGGAATGAAA 
AACAAAGGCG TTGATTTTAA TAAAGACTTA GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GTGGACTATT ACCGTGATGG TATCGAAGCA GGTTACTTCC GCACAGCTGG TTCAGATAAA 
TATTTATCTG GCCCATTTGC AAACAAAAAG GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT GGTGGCTATG AATACGGTGT TGCACCACGT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA GATATTTATA TGTTCGATAG TGCTACGCCA 
GAACAACGGA CAGCGGCATT TGAATTCATG AAATTCTTAG CTACTCCTGA TTCACAATTG 
TACTGGGCAC AACAAACAGG TTATATGCCA ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
TACAAAAATT CTAAGACAAC CAAAGTACCT GCACAACTTG AAAACGCAGT AAAAGATTTA 
TTCGCTATCC CAGTAGAAGA AAATGCTGAT TCAGCCTATA ATGAAATGCG GACAATTATG 
GAAAGTATTT TTGCTTCATC AAATAAAGAC ACGAGAAAAT TATTGAAAGA TGCAACATCA 
CAATTTGAAC AAGCATGGAA CCAATAA 



EF001-2 (SEQ ID NO:2) 

MKFKTLATT VLATAAIFAL GACGNGNGAK ESNDIVKEVK 
EDTTITFWHA MNGVQEEALT KLTKDFMKEN PKIKVELQNQ 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD TIGWKDAEPI 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA SKTIYEKSNK 
KGVDFNKDLD LTSKDSQEW DYYRDGIEAG YFRTAGSDKY 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD IYMFDSATPE 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA QLENAVKDLF 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



EF001-3 (SEQ ID NO:3) 

TT GTGGTAACGG TAATGGGGCC AAAGAATCAA ACGATATTGT GAAAGAAGTG 
AAGGAAGATA CGACAATCAC TTTCTGGCAT GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
ACAAAATTAA CGAAAGACTT CATGAAAGAA AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TTACCAACAA TTACGCAAGC GTACCCAGGC TGGTTATGGA ATGCTGCACA AGATGAAATG 
TTAGTGGACT TAAAAC C ATA TATGGATGAT GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA ATCGACGGCA AACAATACGG CATTCCATTT 
AATAAATCGA CAGAAATGTT ATTCTATAAT GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA GCTTCTAAAA CAATTTACGA AAAATCCAAC 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG TTAAATAACT ATTACGCAAT TGGAATGAAA 
AACAAAGGCG TTGATTTTAA TAAAGACTTA GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GTGGACTATT ACCGTGATGG TATCGAAGCA GGTTACTTCC GCACAGCTGG TTCAGATAAA 
TATTTATCTG GCCCATTTGC AAACAAAAAG GTAGCAATGT TTGTCGGTAG TATTGCTGGT 



SAYPDLQAKI 
REVLLDGAKI 
EWGAGFDSL 
LSGPFANKKV 
QRTAAFEFMK 
AIPVEENADS 



NSTLTSPKDL 
DGKQYGIPFN 
NNYYAIGMKN 
AMFVGSIAGA 
FLATPDSQLY 
AYNEMRTIME 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT GGTGGCTATG AATACGGTGT TGCACCACGT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA GATATTTATA TGTTCGATAG TGCTACGCCA 
GAACAACGGA CAGCGGCATT TGAATTCATG AAATTCTTAG CTACTCCTGA TTCACAATTG 
TACTGGGCAC AACAAACAGG TTATATGCCA ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
TACAAAAATT CTAAGACAAC CAAAGTACCT GCACAACTTG AAAACGCAGT AAAAGATTTA 
TTCGCTATCC CAGTAGAAGA AAATGCTGAT TCAGCCTATA ATGAAATGCG GACAATTATG 
GAAAGTATTT TTGCTTCATC AAATAAAGAC ACGAGAAAAT TATTGAAAGA TGCAACATCA 
CAATTTGAAC AAGCATGGAA CCAA 



EF001-4 (SEQ ID NO:4) 

CGNGNGAK ESNDIVKEVK 
EDTTITFWHA MNGVQEEALT KLTKDFMKEN 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA 
KGVDFNKDLD LTSKDSQEW DYYRDGIEAG 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL 
TIGWKDAEPI REVLLDGAKI DGKQYGIPFN 
SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 
YFRTAGSDKY LSGPFANKKV AMFVGS I AGA 
IYMFDSATPE QRTAAFEFMK FLATPDSQLY 
QLENAVKDLF AIPVEENADS AYNEMRTIME 



EF002-1 (SEQ ID NO: 5) 

TAAATAGCGG AGGTAGTACA AATGAAATTT TGGAAAAAAG GCTTAACAGC GGCAGCGCTG 
TTAGCAGTGG CGGCAGTAAC TTTAACAGCA TGTGGTGGTT CAAGTGAAAA GAAAGCAACT 
GAAAAGAGTG AAGATGGCAA AACAAAATTA ACAGTAACTA CTTGGAATTA TGACACGACC 
CCAGAATTTG AGAAATTATT CAGAGCTTTT GAAGCGGAAA ATCCTGATAT CACTATTGAA 
CCGGTGGACA TTGCTTCAGA TGATTATGAC ACAAAAGTAA CAACGATGCT TTCATCAGGA 
GATACGACGG ATATTTTAAC CATGAAAAAC TTACTTTCAT ATTCTAATTA CGCGCTACGC 
AATCAATTGG TGGATTTAAC CGATCACGTT AAAGATTTAG ATATCGAACC TGCCAAAGCA 
AGTTACGAGA TGTATGAAAT CGATGGTAAA ACCTATGCTC AGCCTTACCG TACAGATTTC 
TGGGTATTGT ATTACAATAA AAAAATGTTT GATGAAGCCG GAATTGCCTA TCCCGATAAC 
TTAACTTGGG ATGAATATGA AGCGTTAGCG AAAAAATTAT CTAAACCAGA AGAACAAGTA 
TATGGTGCCT ATCAACATAC TTGGCGCTCA ACCGTTCAAG CGATTGCTGC TGCTCAAAAC 
AATGCCAATT TGATTGAACC AAAATACAAT TATATGGAAA CTTATTATGA TCGCGCATTG 
AGAATGCAAA AAGATCAATC ACAAATGGAT TTTGGAACAG CAAAATCAAC AAAAGTAACG 
TATCAATCAC AATTTGAAAA TTCAAAAGCG GCGATGATGT ACATGGGTAG CTGGTACATG 
GGG AC TTTAT TAACAAACAT TGATGATGGC AAAACAAATG TCGAATGGGG GATTGCCGAA 
ATACCACAAC AAGAAAAAGG CAAAGCAACT ACCTTTGGCT CACCGACAAG TTTTGCAATT 
AATAAAAACA GTAAAAAACA AAAAGCTGCT CAAAAATTCT TAGACTTTGC TTCAGGTAAA 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG GTGGTTCCTT CTTATAAAAC AGATGAAATT 
GATAAAATCT ACTTTGCAAG AAAAGGAATG CCTTCAGACG AGTCTCACAA AAAGCCTTTA 
AC CC AG AT AC AATTAATTTA G 



EF002-2 {SEQ ID NO: 6) 

MKFW KKGLTAAALL AVAAVTLTAC GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 
EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 
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KIYFARKGMP SDESHKKPLT QIQLI 

EF002-3 (SEQ ID NO:7) 

A TGTGGTGGTT CAAGTGAAAA GAAAGCAACT 

GAAAAGAGTG AAGATGGCAA AACAAAATTA ACAGTAACTA CTTGGAATTA TGACACGACC 
CCAGAATTTG AGAAATTATT CAGAGCTTTT GAAGCGGAAA ATCCTGATAT CACTATTGAA 
CCGGTGGACA TTGCTTCAGA TGATTATGAC ACAAAAGTAA CAACGATGCT TTCATCAGGA 
GATACGACGG ATATTTTAAC CATGAAAAAC TTACTTTCAT ATTCTAATTA CGCGCTACGC 
AATCAATTGG TGGATTTAAC CGATCACGTT AAAGATTTAG ATATCGAACC TGCCAAAGCA 
AGTTACGAGA TGTATGAAAT CGATGGTAAA ACCTATGCTC AGCCTTACCG TACAGATTTC 
TGGGTATTGT ATTACAATAA AAAAATGTTT GATGAAGCCG GAATTGCCTA TCCCGATAAC 
TTAACTTGGG ATGAATATGA AGCGTTAGCG AAAAAATTAT CTAAACCAGA AGAACAAGTA 
TATGGTGCCT ATCAACATAC TTGGCGCTCA ACCGTTCAAG CGATTGCTGC TGCTCAAAAC 
AATGCCAATT TGATTGAACC AAAATACAAT TATATGGAAA CTTATTATGA TCGCGCATTG 
AGAATGCAAA AAGATCAATC ACAAATGGAT TTTGGAACAG CAAAATCAAC AAAAGTAACG 
TATCAATCAC AATTTGAAAA TTCAAAAGCG GCGATGATGT ACATGGGTAG CTGGTACATG 
GGG AC TTTAT TAACAAACAT TGATGATGGC AAAACAAATG TCGAATGGGG GATTGCCGAA 
ATACCACAAC AAGAAAAAGG CAAAGCAACT ACCTTTGGCT CACCGACAAG TTTTGCAATT 
AATAAAAACA GTAAAAAACA AAAAGCTGCT CAAAAATTCT TAGACTTTGC TTCAGGTAAA 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG GTGGTTCCTT CTTATAAAAC AGATGAAATT 
GATAAAATCT ACTTTGCAAG AAAAGGAATG CCTTCAGACG AGTCTCACAA AAAGCCTTTA 
ACCCAGATAC AATTAATT 

EF002-4 (SEQ ID NO: 8) 

C GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 

EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 
KIYFARKGMP SDESHKKPLT QIQLI 



EF003-1 (SEQ ID NO:9) 

TAGGAGGACA AAAGAATGAA GAAGTTTTAT 
ATTTTAGCTG CCTGTGGGGG AAATAAACAA 
GTTGCCGTGC AATTGGAATC TTCAAAAGAT 
AAAAAAGGGT ACAAAATTAA CATTATGGAA 
GTGCAACATG ACGAAGCGGA TGCTAATTTT 
AACAAAGAGA AAAAAGCTGA TTTAGTGGCT 
TTCTATTCAA AAGAATACCA AGATGCGAAA 
CCTAGCGATC CAACCAATGA AGGTCGTGCT 
AAATTAAAAG AAGGTGTCGG CTTTAACGGC 
AACATCACTT TTGAAAGCAT TGATTTACTG 
ATCGCTATGG TGTTCTGCTA CCCAGCCTAC 
GCGATCTTGT TAGAAGATAA AGAAGCAAGT 
AAAGGCGAAA AAGATAGCGA AAAAATCAAG 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA 



TTAGCNACAT TCGCTGTTAT TGCAACAGTT 
GCAGACCAGA AAGAAGACAA GGAGATTACC 
ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 
GTGAGCGACA ATGTTGCCTA CAACGATGCC 
GCGCAACATC AACCCTTCAT GGAAATGTTT 
GTGCAACCGA TTTATTATTT TGCTGGTGGT 
GATTTACCTG AAAATGCCAA AGTGGGGATT 
TTAGCAATTT TAAATGCAAA CGGCGTGATT 
ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 
AATTTAGCTA AAGCCTATGA TGAAAAAGAC 
TTAGAACCTG CTGGTTTAAC AACGAAAGAT 
AAACATTACG CATTGCAAGT TGTGACACGC 
GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GGCGCCAATA TTCCTGCGTT TTAA 



EF003-2 (SEQ ID NO: 10) 
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MKKFYL ATFAVIATVI LAACGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEK 
KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFESIDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF 

EF003-3 (SEQ ID NO:ll) 

CTGTGGGGG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC 
GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 
AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA ATGTTGCCTA CAACGATGCC 
GTGCAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC AACCCTTCAT GGAAATGTTT 
AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA TTTATTATTT TGCTGGTGGT 
TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG AAAATGCCAA AGTGGGGATT 
CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT TAAATGCAAA CGGCGTGATT 
AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 
AACATCACTT TTGAAAGCAT TGATTTACTG AATTTAGCTA AAGCCTATGA TGAAAAAGAC 
ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG CTGGTTTAAC AACGAAAGAT 
GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG CATTGCAAGT TGTGACACGC 
AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA TTCCTGCGTT T 



EF003-4 (SEQ ID NO:12) 

CGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEK 

KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFESIDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF 



EF004-1 (SEQ ID NO:13) 

TAAATCGAAA GAAGGATGAT AGAAATGAAA AAAATGATTA AATTTGCAGG CATTGCTCTT 
ATTTTTGCAG CTCTTCTCTC TGCCTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 
GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC TGACGAGAAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AATAA 

EF004-2 (SEQ ID NO: 14) 

MKK MIKFAGIALI FAALLSACSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 
ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 

EF004-3 (SEQ ID NO:15) 
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CTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 

GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC TGACGAGAAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AA 



EF004-4 (SEQ ID NO: 16) 

CSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 

ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 



EF005-1 (SEQ ID NO : 17 ) 

TAAAAAATGA AAAAACGATT GACGATTGTG GGGATGCTTT TTCTGGCCAT TTTAGTAATG 
GTTGGTTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
GAATCCTTTG CCAATAGTGT AGCTAAACTG GATCAACAGC GCGAGGAAAG CAAGAATAAC 
TGGCCTGCAG AAGACTATGC TACAATTACT AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GTTGCCACAG AATCAGCCAA TTCTGGGAAT GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
AGTATTTCAG CGTTGTTAGC AACTTTATTT GATGATTTTA AAGTCCCAGA AGGCGGTTTG 
AAGAATGCTA GTGTCACAAC AATTCATTAC AAAAATGGCG AATATAC TTT GGATAAAGTC 
AATGATGTCA GCTACTTAGA AGCAGGCGAA AAAGAATCAA AATAA 

EF005-2 (SEQ ID NO: 18) 

MKKRLTIVG MLFLAILVMV GCGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 
WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWSHGLS I SALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 

EF005-3 (SEQ ID NO: 19) 

TTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
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GAATCCTTTG CCAATAGTGT AGCTAAACTG 
TGGCCTGCAG AAGACTATGC TACAATTACT 
GTTGCCACAG AATCAGCCAA TTCTGGGAAT 
AGTATTTCAG CGTTGTTAGC AACTTTATTT 
AAGAATGCTA GTGTCACAAC AATTCATTAC 
AATGATGTCA GCTACTTAGA AGCAGGCGAA 



GATCAACAGC GCGAGGAAAG CAAGAATAAC 
AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
GATGATTTTA AAGTCCCAGA AGGCGGTTTG 
AAAAATGGCG AATATACTTT GGATAAAGTC 
AAAGAATCAA AA 



EF005-4 (SEQ ID NO:20) 



CGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 

WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWSHGLS I SALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 



EF006-1 (SEQ ID NO:21) 



TAAACGATAA ATGGAGGGAA TAAGATGAAA 
GCAGTAGCTG TCTTAGTTTT AGGGGCTTGC 
AAAGTTGGAG CTTCACCAGT TCCACATGCA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC 
GCGGTTAAAG AAAATGATTA TGACTTTGTG 
GGGCTTTACT CGAAAAAATA CAAATCGTTA 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC 
ATCACGCTGA AAGAAGGGGT AGACCGGACA 
AC T AAAAAGT TGAAATTCAA TCATGAAAGT 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC 
CCGAAAAAAG ATGCGATTGC CTTAGAAAAA 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA 
TAA 



AAACGTACAT TATGGTCAGT AATTACTGTA 
GGCAATAAAA AGAGTGATGA CTCGGTCTTG 
GAGATTTTAG AACATGTAAA ACCTTTATTA 
ACTTATACAG ATTACGTGCT ACCTAACAAG 
TATTTCCAAC ATGTGCCGTT CTTTAATGAA 
AATGCAGGTG CGATTCATTT AGAACCAGTT 
CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ACTGCTACTT TCGATGATAT TGATAAAAAT 
GATCCAGCAA TCATGACCAC TCTTTATGAC 
TCAAACTTTG CCGTGGATCA AGGATTAAAT 
GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
TGGAACGGCG CTATTGTTCC AGTCAATGAA 



EF006-2 (SEQ ID NO:22) 



MKK RTLWSVITVA VAVLVLGACG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 
KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 



EF006-3 (SEQ ID NO:23) 



TTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 

AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT CTTTAATGAA 
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT 
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CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 



EF006-4 (SEQ ID NO:24) 

CG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 

KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 

EF008-1 (SEQ ID NO:25) 

TAAACCGTGA GAAAGAAATG GAGGAATCAA CGAATGAAAA AATTTAGTTT ATTTTTTTTA 
ACACTTTTAG CAGGGTTAAC GTTAGCTGCT TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 
AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTGAAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN 
ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA 
TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG TCTCAAAAGA AGTGAAACGA 
CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCGATACG 
TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAATAA 



EF008-2 (SEQ ID NO:26) 

MKKFSLFFLT LLAGLTLAAC GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 
KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 
KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 
AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 
PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 

EF008-3 (SEQ ID NO:27) 

T TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 

AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTGAAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
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TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN 
ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA 
TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG TCTCAAAAGA AGTGAAACGA 
CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCGATACG 
TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAA 

EF008-4 (SEQ ID NO:28) 

C GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 

KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 
KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 
AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 
PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 



EF009-1 (SEQ ID NO:29) 

TGACAAATGA AAAAATTTAG TAAATTAATT GGACTTATTG GGGTATTAGC TTTTACGATT 
GCAGGTTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA - 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAATAA 

EF009-2 (SEQ ID NO:30) 

MKKFSKLIG LIGVLAFTIA GCASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 
LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV NAPLGIYANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 

EF009-3 (SEQ ID NO:31) 

TTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
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GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAA 



EF009-4 (SEQ ID NO:32) 

CASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 

LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVS IGNTV NAPLGIYANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 



EF010-1 (SEQ ID NO:33) 

TGAAAGAATA AAATTGTACA GGAGGAAATA AGGAATGAAA AAATGGCAAA AAGGATTAGC 
CGTAGCTGGC GCACAGCTTT AGCTGTAGGA CTAAGCGCGT GCGGTAAATC TTCAAAAGAT 
GCAGCGTCAA AAGGTGATGA TAGTACACCA ACGTTATTAA TGTATCGTGT TGGGGACAAA 
CCAGATAATT ATGACCAATT AATCGATAAT GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GCAAAATTAA AAATGGAATT TGTTGGTTGG GGCGATTGGG ACCAAAAAAT GTCAACAATC 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA TTAGCACAAA ATTATGCAAC GAATGCACAA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA GCACCTAAAT ATGCCAAAGA AGCCTATGAT 
CAATTGCCAG ATAAC TATAT TAAAGGAAAT ACGATTAATG GAAAACTGTA TGCGTTCCCA 
ATTTTAGGTA ACTCTTACGG TCAACAAGTT TTAACTTTTA ATAAAGAATA TGTCGATAAA 
TACAATTTAG ATATTAGTAA AGTCGATGGT AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GAATTCCNTA AAAANGANCC AAATATTGCT GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
ACAGGTAATT ATGACTTCCC TATTGGTAAC CAATATCCAT TTGCAGTAAA AACAACTGAT 
ACTGGCTCAC CAAAAATTAT TAACCAATAT GCCGACAAAG ACATGATTAA TAACTTAAAA 
GTCTTGCATC AATGGTATAA AGATGGCTTG ATTCCAACAG ATGCTGCTAC AAGTACAACA 
CCATATGACT TAAATACCAA TACTTGGTTT ATGCGTCAAG AAACACAAGG ACCTATGGAT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT GGCAAACCAC TTGTTTCTCG TCCACTAACA 
GAACCATTAA AAACAACAGC TCAAGCGCAA ATGGCTAACT ATGTTGTTGC AAACACGTCT 
AAAAACAAAG AAAAATCTGT TGAATTGTTA GGTTTATTAA ACAGCAATCC AGAATTGTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA CAATATGAAA AAGTTGGCGA TGATCGTGTG 
AAATTGTTGA AAGATTACAC ACCAACAACT CATTTGAGTG CTTGGAACAC AGGAAACAAC 
TTAATCATTT GGCCAGAAGA ATCTGTCACT GAAGAAATGG TTAAAGAACG TGATAAGAGC 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
ACTGAAATCA CTAACGTTGC TACAGTTATG AACCGTTACG CAGCAAGCTT AAATACAGGA 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
GATAAAGTTC AAAAAGAAAT GCAAACACAA TTAGACGAAT ATATCCAATC TCAAAAATAA 

EF010-2 (SEQ ID NO:34) 

MAKRISR SWRTALAVGL SACGKSSKDA ASKGDDSTPT LLMYRVGDKP 
DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG DWDQKMSTIV ASGESYDISL AQNYATNAQK 
GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 
NLDISKVDGS YESATEVLKE FXKXXPNIAA FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 
GSPKIINQYA DKDMINNLKV LHQWYKDGLI PTDAATSTTP YDLNTNTWFM RQETQGPMDY 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM ANYWANTSK NKEKSVELLG LLNSNPELLN 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH LSAWNTGNNL IIWPEESVTE EMVKERDKS I 
EEAKDSPILG FTFVNDKVKT EITNVATVMN RYAASLNTGT VDPEETLPKL MDDLKTAGWD 
KVQKEMQTQL DEYIQSQK 



EF010-3 (SEQ ID NO:35) 
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GT GCGGTAAATC TTCAAAAGAT 
GCAGCGTCAA AAGGTGATGA TAGTACACCA 
CCAGATAATT ATGACCAATT AATCGATAAT 
GCAAAATTAA AAATGGAATT TGTTGGTTGG 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA 
CAATTGCCAG ATAACTATAT TAAAGGAAAT 
ATTTTAGGTA AC TCTTAC GG TCAACAAGTT 
TACAATTTAG ATATTAGTAA AGTCGATGGT 
GAATTCCNTA AAAANGANCC AAATATTGCT 
ACAGGTAATT ATGACTTCCC TATTGGTAAC 
ACTGGCTCAC CAAAAATTAT TAACCAATAT 
GTCTTGCATC AATGGTATAA AGATGGCTTG 
CCATATGACT TAAATACCAA TACTTGGTTT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT 
GAACCATTAA AAACAACAGC TCAAGCGCAA 
AAAAACAAAG AAAAATCTGT TGAATTGTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA 
AAATTGTTGA AAGATTACAC ACCAACAACT 
TTAATCATTT GGCCAGAAGA ATCTGTCACT 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT 
ACTGAAATCA CTAACGTTGC TACAGTTATG 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA 
GATAAAGTTC AAAAAGAAAT GCAAACACAA 



ACGTTATTAA TGTATCGTGT TGGGGACAAA 
GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GGCGATTGGG ACCAAAAAAT GTCAACAATC 
TTAGCACAAA ATTATGCAAC GAATGCACAA 
GCACCTAAAT ATGCCAAAGA AGCCTATGAT 
ACGATTAATG GAAAAC TGTA TGCGTTCCCA 
TTAACTTTTA ATAAAGAATA TGTCGATAAA 
AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
CAATATCCAT TTGCAGTAAA AACAACTGAT 
GCCGACAAAG ACATGATTAA TAACTTAAAA 
ATTCCAACAG ATGCTGCTAC AAGTACAACA 
ATGCGTCAAG AAACACAAGG ACCTATGGAT 
GGCAAACCAC TTGTTTCTCG TCCACTAACA 
ATGGCTAACT ATGTTGTTGC AAACACGTCT 
GGTTTATTAA ACAGCAATCC AGAATTGTTA 
CAATATGAAA AAGTTGGCGA TGATCGTGTG 
CATTTGAGTG CTTGGAACAC AGGAAACAAC 
GAAGAAATGG TTAAAGAACG TGATAAGAGC 
GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
AACCGTTACG CAGCAAGCTT AAATACAGGA 
TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
TTAGACGAAT ATATCCAATC TCAAAAA 



EF010-4 (SEQ ID NO:36) 

CGKSSKDA ASKGDDSTPT LLMYRVGDKP 
DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG 
GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT 
NLDISKVDGS YESATEVLKE FXKXXPNIAA 
GSPKIINQYA DKDMINNLKV LHQWYKDGLI 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH 
EEAKDSPILG FTFVNDKVKT EITNVATVMN 
KVQKEMQTQL DEYIQSQK 



DWDQKMSTIV ASGESYDISL AQNYATNAQK 
INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 
FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 
PTDAATSTTP YDLNTNTWFM RQETQGPMDY 
ANYWANTSK NKEKSVELLG LLNSNPELLN 
LSAWNTGNNL IIWPEESVTE EMVKERDKSI 
RYAASLNTGT VDPEETLPKL MDDLKTAGWD 



EF011-1 (SEQ ID NO:37) 

TAACGTTTTT GGAGGAAAAG AATGAAAAAG 
ATGGGACTGT TAATGTTAAG TGCTTGTCAA 
ACAGAAACAA CAGCTAAAAC GGAAGTCACA 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG 
AAATACCAAA AAGTTGAATC AGCAGGCGGC 
CAACTAAAAC CAGACTTAAT TATTATTTCT 
AAAGCCATTG CGCCAACCAT TTACTTAGCT 
AAACAAAATA TCGAAACGTT AGGCACTATT 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC 



AAATTTTTAG CAATGATGGC AGTTTCAATG 
ACAAATAAAA AAACAGCAGA TTCTGCAACA 
GTCAAAGACA CCAATGGTCA ATTAACCGTT 
GATAATGGTT CCTTGGATAC AATGGATGCA 
CCAACTAAAA ATATCCCTGC GTATTTGAAA 
ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
GGTCGTCAAC AAGATTATCA AGAACAATTA 
GTAGATGCCA AAAATCCTTG GGCATCAACG 
TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 



WO 98/50554 



PCT/US98/089S9 



89 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 

ACTCATGGGC AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA 
AAAGGATTAG AGTAA 

EF011-2 (SEQ ID NO:38) 

MKKK FLAMMAVSMM GLLMLSACQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 

KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK YQKVESAGGI KEPDLEKINQ 

LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEKI 

TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 

HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EF011-3 (SEQ ID NO:39) 



TTGTCAA ACAAATAAAA AAACAGCAGA TTCTGCAACA 

ACAGAAACAA CAGCTAAAAC GGAAGTCACA GTCAAAGACA CCAATGGTCA ATTAACCGTT 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT GATAATGGTT CCTTGGATAC AATGGATGCA 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG CCAACTAAAA ATATCCCTGC GTATTTGAAA 
AAATACCAAA AAGTTGAATC AGCAGGCGGC ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
CAACTAAAAC CAGACTTAAT TATTATTTCT GGTCGTCAAC AAGATTATCA AGAACAATTA 
AAAGCCATTG CGCCAACCAT TTACTTAGCT GTAGATGCCA AAAATCCTTG GGCATCAACG 
AAACAAAATA TCGAAACGTT AGGCACTATT TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 
ACTCATGGGC AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA 
AAAGGATTAG AG 



EF011-4 (SEQ ID NO:40) 

CQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 
KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK 
LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK 
TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF 
HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE 
WYLSGGGLES MHLMIEDVKK GLE 

EF012-1 (SEQ ID NO:41) 

TGAGGGGGCA ACAACATGAA ATTGGGGAAA AAAGTAGTAG GTTTGATTGC AACAGGGTTT 
CTTTTAGCCG CATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA 
GCAGCTGAAC AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT GGATACAACA 
CAAACAACAG ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG CCTTTATCGG 
TTTGATGATG ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 



YQKVESAGGI KEPDLEKINQ 
QNIETLGTIF DKEEVAKEKI 
GLIHDTFGFK AADDKIEAST 
LIQKTDAGKN DKVIMLQPDV 
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GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GG AG CAACGG CGAGCCAATC 
ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 
AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 
GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 
GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 
TTTGTCGAAG CGC AAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG 
CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 
GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 
GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA 
CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAATGA 

EF012-2 (SEQ ID NO:42) 

MKLGKK WGLIATGFL LAACGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 
TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVA VST IKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF012-3 (SEQ ID NO: 43) 

ATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA 
GCAGCTGAAC AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT GGATACAACA 
CAAACAACAG ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG CCTTTATCGG 
TTTGATGATG ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 
GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC 
ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 
AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 
GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 
GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 
TTTGTCGAAG CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG 
CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 
GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 
GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA 
CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
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TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAA 



EF012-4 (SEQ ID NO:44) 

CGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 

TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF013-1 (SEQ ID NO:45) 

TAACGAAAAA TGAAAAAAAT TGCTTTGTTC AGTATGTTAA CGTTCAGTGT ATTGTCTTTA 
AGTCTAGCAG GATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 
TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG 
CCTTCTGCAG GAACTGATGA AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 
ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA 
CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA ATTAA 

EF013-2 (SEQ ID NO:46) 

MKKIALFS MLTFSVLSLS LAGCGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 
VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWFNPE RNEINGTTLP 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG 
TKNRLDVTLN GEIGTPYLFD LPN 
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ATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 
TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG 
CCTTCTGCAG GAACTGATGA AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 
ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA 
CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA AT 

EF013-4 (SEQ ID NO:48) 

CGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 
VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI 
TKNRLDVTLN GEIGTPYLFD LPN 

EF014-1 (SEQ ID NO:49) 

TGATGGTGGA GACTTTTTAA GAGAGAGGAA GTACAGCCAA TGAGTAGGAA GCGAAAAATC 
AGCTTAATTA GTTTAGTCAT CATTTTGGTT TTTGTCACAG TCGGCTCAGC ATACTTTGCT 
GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 
ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC 
AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 
AAAAATTACC AAGGGATGCA GCGAATTGAA GCCGCC TATA CCTACGATGG ACCAACAGCT 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 
TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 
CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT 
ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 
AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA 
AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC 
GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA ATAA 

EF014-2 (SEQ ID NO:50) 

MSRKRKIS LISLVIILVF VTVGSAYFAV AGSYLKKTID KGYVPIKNDY 
NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GGIDVNVKQA 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 



TITNWFNPE RNEINGTTLP 
IATVDQRNSA PVQIDIPSSG 
LASFTADAQG NFTASNLVPG 
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YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF014-3 (SEQ ID NO:51) 

TGCT 

GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 
ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC 
AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 
AAAAATTACC AAGGGATGCA GCGAATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 
TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 
CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT 
ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 
AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA 
AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC 
GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA A 



EF014-4 (SEQ ID NO:52) 
AV AGSYLKKTID KGYVPIKNDY 

NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK 
YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF015-1 (SEQ ID NO: 53) 

TAATTAAAAA TGTGTAAAAA GGGTCTGATG AAAAAAGGAG ACATAATAGT TATTATCTTT 
TTAATAGCTA TCTCTTTTTC TCCATATTTT ATTTTTTTTC ACAATAATCC ATTTAACTCC 
AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 
ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATCCATCAAA AGGGCAATAT 
AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 
ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 
AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA CTAA 

EF015-2 (SEQ ID NO: 54) 

MK KGDIIVIIFL IAISFSPYFI FFHNNPFNSK SFDDTKYAW KIDGKEIERI 
NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KDYYIY 

EF015-3 (SEQ ID NO:55) 



TKKITYLSLP 
LSFIKLIDAV 
RGFRQQEIIQ 
QQLSFDWRTF 
SDYTVQDEAA 



RDSFVQIDAK 
GGIDVNVKQA 
AVEDKLKSGQ 
SNEGRSMVEL 
EENEMTSING 
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CAATAATCC ATTTAACTCC 

AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 
ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATCCATCAAA AGGGCAATAT 
AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 
ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 
AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA C 

EF015-4 (SEQ ID NO:56) 

NNPFNSK SFDDTKYAW KIDGKEIERI 

NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KDYYIY 



EF016-1 (SEQ ID NO:57) 

TGACGGTTGC CCCCGTCCAA TAGAAAGGAG 
TTGCTGGTTA TCTGTTGTAG TTTACTCCTA 
GAAGATCAAT GGACACGGAT TAACGAAGAA 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT 
ATGAAAGAAA CAGAATTACA AAATCAAACC 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT 
AAATTTGTTA AAGACCAAAC ACCTATTTTA 
TTAAAATCTG GTC GAATTG A CGGACTCCTA 
TCCCACGAAG ATAATTTAAA AAACTATACT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT 
GTTACAAATA ACACAAAAAT AAACTAA 



TTTATGATGA AAAAGAAATA TTCTTTAGCC 
TTTGCAGGTT GTGGTAAAAG AAAAAGCAAC 
AAACGGATTA TTATTGGCTT AGATGACTCC 
GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACACAACCTT ACATGACGAA CGACCAAGTA 
ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
TATGACGGCT TTAATGAAGC TTTCTTAGAT 
ATCGATCGCG TTTACGCCAA CTACTATCTT 
ATTTCTCATG TAGGCTATGA CAATGAAGAT 
CAATTAGTCC AAAAAATCAA TACTGCCTTT 
AAAATTTCTC AAAAATGGTT TGGAGAGGAC 



EF016-2 (SEQ ID NO:58) 

MMKKKYSLAL LVICCSLLLF AGCGKRKSNE 
VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK 



DQWTRINEEK RIIIGLDDSF 
VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
ISQKWFGEDV TNNTKIN 



EF016-3 (SEQ ID NO: 59) 
AAGCAAC 

GAAGATCAAT GGACACGGAT TAACGAAGAA 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT 
ATGAAAGAAA CAGAATTACA AAATCAAACC 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT 
AAATTTGTTA AAGACCAAAC ACCTATTTTA 
TTAAAATCTG GTCGAATTGA CGGACTCCTA 



AAACGGATTA TTATTGGCTT AGATGACTCC 
GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACACAACCTT ACATGACGAA CGACCAAGTA 
ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
TATGACGGCT TTAATGAAGC TTTCTTAGAT 
ATCGATCGCG TTTACGCCAA CTACTATCTT 
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TCCCACGAAG ATAATTTAAA AAACTATACT ATTTCTCATG TAGGCTATGA CAATGAAGAT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT CAATTAGTCC AAAAAATCAA TACTGCCTTT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT AAAATTTCTC AAAAATGGTT TGGAGAGGAC 
GTTACAAATA ACACAAAAAT AAAC 

EF016-4 (SEQ ID NO: 60) 

SNE DQWTRINEEK RIIIGLDDSF 

VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK ISQKWFGEDV TNNTKIN 



EF017-1 (SEQ ID NO:61) 

TGAGGTGTTT TTATGAAAAG GGCAACAAAG CAAAGGCTGT CTTTGGCAGC AATCATGGTT 
CTACTTCTCT CGGGCTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 
TTACGGGTCG GGATTGATTC GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 
ACCGCAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 
ACTTTTACGA TTCGAAAAGA TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT 
TTTGAATACT CTTGGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GGGGTAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA 
GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 
CAAGCGAATG TTTCGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
AAAAATCTTT TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CAAGAACAAG GCAATGCAGC TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 
TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
GATTTAAATC CTAGTGAACC AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT 
TCACCAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 
AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA 
GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCC ATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GCGACAAGTA A 



EF017-2 (SEQ ID NO: 62) 

MKRATKQ RLSLAAIMVL LLSGCGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 
AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 
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LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 
PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 
NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 

EF017-3 (SEQ ID NO: 63) 

CTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 

TTACGGGTCG GGATTGATTC GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 
ACCGCAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 
ACTTTTACGA TTCGAAAAGA TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT 
TTTGAATACT CTTGGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GGGGTAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA 
GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 
CAAGCGAATG TTTCGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
AAAAATCTTT TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CAAGAACAAG GCAATGCAGC TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 
TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
GATTTAAATC CTAGTGAACC AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT 
TCACCAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 
AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA 
GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GCGACAAG 



EF017-4 (SEQ ID NO: 64) 

CGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 

AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 
LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 
PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 
NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 



EF018-1 (SEQ ID NO: 65) 

TGTCATTACA ACGATACCAA TTTTAATCAT 
CGGTATGATG GCCGGTGCAG TAAAAGAATA 



TTATCCATTA CTACAAAAAC ACTTTATCGG 
AAGAAAGTAG GGAACAATAT GAAAAAAGTT 
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TTAGGCGGTT TATTGGTGGC AACGGCGGTC GTTAGTTTAG CGGCCTGTAG CGGTGGGGAA 
AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT 
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT 
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AGATTAAAAA AGACTTAAAT 
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAACTGGAC AAGCAGCATT GAAAGCAGAT 
TCTTGGGCCT ATTCTTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT x 
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT 
GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 
ACCACAGTGG GCGATGGTGC AGGACCATTT TTAGACAAAT TACAAGACTT CTTAGGTGTT 
CCTTTAGAGG ATAAAAATGG TAAATACTAT GATCGAAATT TAGATAAAGA ATATTTAGAA 
TGGTTAAAAA CATTTAATGA TGTTTACCGA GCAGGCAATA TTAGTGATGA TAGCTTCACA 
GATGATGGGG CAACGTTTGA TGAAAAAGTG AAACAAGGAA ATTATGCAAC CATGCTCGTT 
GCTGGAACCA GTGGTCAAGG TGGGAACTTC ACAGAATTTA TGAAAAAATC TGGCACACGT 
TATATAGCCA TTGATGGACC AAGTAGCACT TCTGGCCGAA AACCAACATT AAATCAAACC 
GGCATTTCAG GTTGGTTAAG TAATTACATT ACGAAAGATG CGAAAGATCC AGCAAAAGTC 
ACTCAACTGT TCACATATTT AATTGATGAA CCGGGACAAA TTTTAACAAA ATATGGCGTT 
GAAGGAGTTA CTTATGCGTA CAATGATCAA GGAAAAATTG ATTATTTACC AGAAGTGAAA 
AAATTAGAAC AAACAGACAA TGATGCCTAC AACAAAAAAT ATGGCATTAG TCGTTTCCTA 
TACTTTAACA ACGACCGTGT CAATAAACTA AAAGTACCAA TGGAAAGTGC TTTAACGCAA 
ATGCAAGAAT GGGGCAAAGG AAAATTAGTC CCACATTTCG TAATTGAAAA TATTAATCCA 
GATGCAGGAA CGCCGGAAGC TCGTGCGAAT GAAGCGATTG AAACCAAACT AAATACAACC 
GTTATTTCAA TGATTCGTGC GAAAGATGAT AAAGCCTTTG ACAAATCTTT AGAAGACTAC 
AAAGCATTCT TAAAATCAAA TAAATGGGAT GCAATTGAAA AAATAAAATC TGAGAAAATG 
GCGGAAAACA GAGACAAACT TAAGTAA 

EF018-2 (SEQ ID NO: 66) 

MKKV LGGLLVATAV VSLAACSGGE 

KKASSDVSIK DRYELDEKTP AWKLDKKKEP TKIKWYINSD WTALPFGKDV 
VDIEFISGDD SKLNAMISSG DMPDIVTLTE KTGQAALKAD SWAYSLNDLA 
VNQDTFKWYA LEDGKTYGYP NYSNTKADYE SGNIPVNDNF VIREDVYNAL 
FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF LDKLQDFLGV PLEDKNGKYY 
WLKTFNDVYR AGNISDDSFT DDGATFDEKV KQGNYATMLV AGTSGQGGNF 
YIAIDGPSST SGRKPTLNQT GISGWLSNYI TKDAKDPAKV TQLFTYLIDE 
EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY NKKYGISRFL YFNNDRVNKL 
MQEWGKGKLV PHFVIENINP DAGTPEARAN EAIETKLNTT VISMIRAKDD 
KAFLKSNKWD AIEKIKSEKM AENRDKLK 

EF018-3 (SEQ ID NO: 67) 

CTGTAG CGGTGGGGAA 
AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT 
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT 
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AGATTAAAAA AGACTTAAAT 
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAACTGGAC AAGCAGCATT GAAAGCAGAT 
TCTTGGGCCT ATTCTTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT 
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT 
GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 



TTAQIKKDLN 
KKYDPYLMKV 
GKPDVSTPEN 
DRNLDKEYLE 
TEFMKKSGTR 
PGQILTKYGV 
KVPMESALTQ 
KAFDKSLEDY 
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ACCACAGTGG GCGATGGTGC AGGACCATTT TTAGACAAAT TACAAGACTT CTTAGGTGTT 
CCTTTAGAGG ATAAAAATGG TAAATACTAT GATCGAAATT TAGATAAAGA ATATTTAGAA 
TGGTTAAAAA CATTTAATGA TGTTTACCGA GCAGGCAATA TTAGTGATGA TAGCTTCACA 
GATGATGGGG CAACGTTTGA TGAAAAAGTG AAACAAGGAA ATTATGCAAC CATGCTCGTT 
GCTGGAACCA GTGGTCAAGG TGGGAACTTC ACAGAATTTA TGAAAAAATC TGGCACACGT 
TATATAGCCA TTGATGGACC AAGTAGCACT TCTGGCCGAA AACCAACATT AAATCAAACC 
GGCATTTCAG GTTGGTTAAG TAATTACATT ACGAAAGATG CGAAAGATCC AGCAAAAGTC 
ACTCAACTGT TCACATATTT AATTGATGAA CCGGGACAAA TTTTAACAAA ATATGGCGTT 
GAAGGAGTTA CTTATGCGTA CAATGATCAA GGAAAAATTG ATTATTTACC AGAAGTGAAA 
AAATTAGAAC AAACAGACAA TGATGCCTAC AACAAAAAAT ATGGCATTAG TCGTTTCCTA 
TACTTTAACA ACGACCGTGT CAATAAACTA AAAGTACCAA TGGAAAGTGC TTTAACGCAA 
ATGCAAGAAT GGGGCAAAGG AAAATTAGTC CCACATTTCG TAATTGAAAA TATTAATCCA 
GATGCAGGAA CGCCGGAAGC TCGTGCGAAT GAAGCGATTG AAACCAAACT AAATACAACC 
GTTATTTCAA TGATTCGTGC GAAAGATGAT AAAGCCTTTG ACAAATCTTT AGAAGACTAC 
AAAGCATTCT TAAAATCAAA TAAATGGGAT GCAATTGAAA AAATAAAATC TGAGAAAATG 
GCGGAAAACA GAGACAAACT TAAG 

EF018-4 (SEQ ID NO:68) 

CSGGE 

KKASSDVSIK DRYELDEKTP AWKLDKKKEP TKIKWYINSD WTALPFGKDV TTAQIKKDLN 
VDIEFISGDD SKLNAMISSG DMPDIVTLTE KTGQAALKAD SWAYSLNDLA KKYDPYLMKV 
VNQDTFKWYA LEDGKTYGYP NYSNTKADYE SGNIPVNDNF VIREDVYNAL GKPDVSTPEN 
FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE 
WLKTFNDVYR AGNISDDSFT DDGATFDEKV KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 
YIAIDGPSST SGRKPTLNQT GISGWLSNYI TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 
EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY NKKYGISRFL YFNNDRVNKL KVPMESALTQ 
MQEWGKGKLV PHFVIENINP DAGTPEARAN EAIETKLNTT VISMIRAKDD KAFDKSLEDY 
KAFLKSNKWD AIEKIKSEKM AENRDKLK 



EF019-1 (SEQ ID NO:69) 

TAAAGGAGTT ACACAATGAA ACTTTTAAAA AAGACGGTCC TAATTGGTAC AACCCTTCTT 
CTTGGTTCAT TCTTACTCGC AGCTTGTGGT AATACGAATA AAGAAGCCAA CAACGCTGAC 
AAAACACATG AAGTAACAGA TACCTTAGGC AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CGGATTATTG CGAGTTATTT AGAAGATTAT CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
CAATGGACAG TTGGACAAGG CAGCATTCAA GATTATTTAG CGAAAGAATT GAAAGATGTC 
CCCACTATTT CCTATGACTT GCCATATGAA GCGGTTCTAA AATTTGAACC TGACTTATTA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC GGTAAATACA AAGAATACAG TAAAATTGCG 
CCAACTTATG TAGTCAAAAA CGGCGAAAAT GTCACCTGGC GTGATCAATT GGAAGATATT 
GCCACTGTTT TAGATAAAAA AGAACAAGCG AAAAAAGTGT TAGAAGATTA TGATACCTTA 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA AAAGATGCTG GCAAATCTGC GGCAGTCTTA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT AGCGATAATC GCTCAAGCGG AACCGTGCTC 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA TTAGTGGAAG AAATTTCTAA AAACGCTACT 
GCGGATTGGA ATCAAGTTTC TTTAGAAAAA TTAGCTGAGC TTGACGCAGA CCACATTTTC 
CTTGTAAACA GCGATGAATC AGCACCTCTT TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT GATAAAAAAA GTAGTTGGTT ATACAACGGA 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT GTAAAAAAAG CGCTCTTAAA TTAA 

EF019-2 { (SEQ ID NO:70) 

MKLLKK TVLIGTTLLL GSFLLAACGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 

I I ASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
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ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 
IANTQIVEDV KKALLN 

EF019-3 (SEQ ID NO:71) 

TTGTGGT AATACGAATA AAGAAGCCAA CAACGCTGAC 

AAAACACATG AAGTAACAGA TACCTTAGGC AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CGGATTATTG CGAGTTATTT AGAAGATTAT CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
CAATGGACAG TTGGACAAGG CAGCATTCAA GATTATTTAG CGAAAGAATT GAAAGATGTC 
CCCACTATTT CCTATGACTT GCCATATGAA GCGGTTCTAA AATTTGAACC TGACTTATTA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC GGTAAATACA AAGAATACAG TAAAATTGCG 
CCAACTTATG TAGTCAAAAA CGGCGAAAAT GTCACCTGGC GTGATCAATT GGAAGATATT 
GCCACTGTTT TAGATAAAAA AGAACAAGCG AAAAAAGTGT TAGAAGATTA TGATACCTTA 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA AAAGATGCTG GCAAATCTGC GGCAGTCTTA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT AGGGATAATC GCTCAAGCGG AACCGTGCTC 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA TTAGTGGAAG AAATTTCTAA AAACGCTACT 
GCGGATTGGA ATCAAGTTTC TTTAGAAAAA TTAGCTGAGC TTGACGCAGA CCACATTTTC 
CTTGTAAACA GCGATGAATC AGCACCTCTT TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT GATAAAAAAA GTAGTTGGTT ATACAACGGA 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT GTAAAAAAAG CGCTCTTAAA T 



EF019-4 (SEQ ID NO:72) 

CGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 

I IASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 
IANTQIVEDV KKALLN 



EF020-1 (SEQ ID NO:73) 

TGAGGAGATG AGAAAATGAA AAAGGTAGTT TCAATTTTGT TGATGGTTGT TGCAGTCTTC 
ACATTAACTG CATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 
ATAATGAAAG ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA 
TATAAAGATA AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA 
ACTAGCGCAT TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA 
AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA 
GGGAAATTTA AATAA 

EF020-2 (SEQ ID NO:74) 

MKKWS ILLMWAVFT LTACNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY 
KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF020-3 (SEQ ID NO:75) 



ATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 
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ATAATGAAAG ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA 
TATAAAGATA AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA 
ACTAGCGCAT TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA 
AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA 
GGGAAATTTA AA 

EF020-4 (SEQ ID NO:76) 

CNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY 

KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF021-1 (SEQ ID NO:77) 

TAGTTGTTTA AATACATTAA ACTATTTTTA 
TTATTCGGTT TTAGTTTGAT TGCATTAGGT 
GGCAAAGGCA AAACCGCTGA AAGCGGCGGT 
ATCATTACAG ATACAGGCGG CGTGGATGAC 
TTGCAAGCTT GGGGTAAAGA ACATGATTTA 
CAATCGAATG ATGCAGCTGA CTATACAACC 
AACACAATCT TTGGTATTGG CTACTTGCTA 
AACCCTGATA CAAACTTTGT TTTAATCGAT 
TCTGCAACAT TTAGAGATAA TGAAGCAGCT 
ACAAAAACGA ACAAAGTCGG TTTTGTTGGT 
CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT 
GTTGATACGA AATATGCGGC TTCATTTGCT 
GCAATGTACC AAAACGGCGT TGATATCATC 
GTCTTCCAAG AAGCAAAAGA CTTGAATGAA 
GGCGTTGACC GCGATCAAGA TGCTGATGGC 
AACTTCACGT TAACTTCAAC GCTTAAAGGT 
CGTGCGTTAG AAGACAAATT CCCTGGTGGC 
GGCGTTGACT TAACAGACGG CTATTTAAAC 
AAAGATAAAG TAATCTCAGG TGACGTAAAA 



GGAGGCTTTA CAGAAATGAA AAAAGCAAAA 
TTATCAGTTT CACTTGCAGC ATGTGGTGGT 
GGCAAAGGGG ATGCAGCGCA TAGTGCTGTA 
AAGTCGTTCA ACCAATCTTC TTGGGAAGGA 
CCAGAAGGTT CAAAAGGGTA TGCATATATT 
AATATTGACC AAGCGGTATC AAGTAAATTC 
AAAGATGCAA TTTCTTCTGC AGCAGATGCC 
GATCAAATCG ATGGCAAAAA GAATGTCGTT 
TACTTAGCCG GTGTTGCTGC TGCAAATGAA 
GGTGAAGAAG GGGTCGTAAT TGACCGTTTC 
GCTGCGAAAG AATTAGGTAA AGAAATTACT 
GATCCTGCCA AAGGGAAAGC TTTAGCTGCT 
TTCCATGCTT CTGGTGCGAC TGGACAAGGG 
TCAGGTTCTG GCGACAAAGT TTGGGTAATC 
AAGTACAAAA CAAAAGACGG CAAAGAAGAC 
GTCGGCACAG CGGTTCAAGA TATTGCCAAC 
GAACATTTAG TTTATGGATT AAAAGATGGT 
GACAAAACAA AAGAAGCTGT TAAAACAGCA 
GTCCCAGAAA AACCAGAATA A 



EF021-2 (SEQ ID NO:78) 

MKKAKL FGFSLIALGL SVSLAACGGG KGKTAESGGG KGDAAHSAVI 

ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
TIFGIGYLLK DAISSAADAN PDTNFVLIDD QIDGKKNWS ATFRDNEAAY LAGVAAANET 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA AKELGKEITV DTKYAASFAD PAKGKALAAA 
MYQNGVDIIF HASGATGQGV FQEAKDLNES GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 
DKVISGDVKV PEKPE 



EF021-3 (SEQ ID NO:79) 
ATGTGGTGGT 

GGCAAAGGCA AAACCGCTGA AAGCGGCGGT 
ATCATTACAG ATACAGGCGG CGTGGATGAC 
TTGCAAGCTT GGGGTAAAGA ACATGATTTA 
CAATCGAATG ATGCAGCTGA CTATACAACC 



GGCAAAGGGG ATGCAGCGCA TAGTGCTGTA 
AAGTCGTTCA ACCAATCTTC TTGGGAAGGA 
CCAGAAGGTT CAAAAGGGTA TGCATATATT 
AATATTGACC AAGCGGTATC AAGTAAATTC 
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AACACAATCT TTGGTATTGG CTACTTGCTA AAAGATGCAA TTTCTTCTGC AGCAGATGCC 

AACCCTGATA CAAACTTTGT TTTAATCGAT GATCAAATCG ATGGCAAAAA GAATGTCGTT 

TCTGCAACAT TTAGAGATAA TGAAGCAGCT TACTTAGCCG GTGTTGCTGC TGCAAATGAA 

ACAAAAACGA ACAAAGTCGG TTTTGTTGGT GGTGAAGAAG GGGTCGTAAT TGACCGTTTC 

CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT GCTGCGAAAG AATTAGGTAA AGAAATTACT 

GTTGATACGA AATATGCGGC TTCATTTGCT GATCCTGCCA AAGGGAAAGC TTTAGCTGCT 

GCAATGTACC AAAACGGCGT TGATATCATC TTCCATGCTT CTGGTGCGAC TGGACAAGGG 

GTCTTCCAAG AAGCAAAAGA CTTGAATGAA TCAGGTTCTG GCGACAAAGT TTGGGTAATC 

GGCGTTGACC GCGATCAAGA TGCTGATGGC AAGTACAAAA CAAAAGACGG CAAAGAAGAC 
AACTTCACGT TAACTTCAAC GCTTAAAGGT GTCGGCACAG CGGTTCAAGA TATTGCCAAC 

CGTGCGTTAG AAGACAAATT CCCTGGTGGC GAACATTTAG TTTATGGATT AAAAGATGGT 

GGCGTTGACT TAACAGACGG CTATTTAAAC GACAAAACAA AAGAAGCTGT TAAAACAGCA 
AAAGATAAAG TAATCTCAGG TGACGTAAAA GTCCCAGAAA AACCAGAA 



EF021-4 (SEQ ID NO:80) 

CGGG KGKTAESGGG KGDAAHSAVI 
ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP 
TIFGIGYLLK DAISSAADAN PDTNFVLIDD 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA 
MYQNGVDIIF HASGATGQGV FQEAKDLNES 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE 
DKVISGDVKV PEKPE 



EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
QIDGKKNWS ATFRDNEAAY LAGVAAANET 
AKELGKEITV DTKYAASFAD PAKGKALAAA 
GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 



EF022-1 (SEQ ID NO: 81) 

TAAGAGCATA AAAAAATGAA 
ACAATGGTTT GTATTTTATT 
AAAAAGAAAC AGAAAAATAC 
ACGCTCAACA CCTCTGTATT 
GAAGGGTTAT ATAGTTTAGA 
CCGATGATTT CAGAAGATGG 
AGTAACGATG ATCCTGTCAC 
CCTAAAAACG GCTTTGTTTA 
ATCTCAGCGG GGAAATTAGC 
TTAAAGGTGA CGCTCAAAGA 
TTTTTCCCGC AAAATCNAAA 
GATAAAGTCG TCTATAATGG 
TGGCAACTAG CAAAAAATAA 
AATTATACAG TTATCAAAGA 
GATGTGGCTA CACTAAGTGG 
TCGTATCCAA CAGCGACAAT 
ACGCCGCTTG CAAACGAAAA 
CTAGTCAATA ATATTATTGC 
TTTGTGGCGA ATCCCACAAC 
TATAACAAAG AAAAAGCGCA 
GTTAACGTTG AATTGATGGT 
CAAGGCTCGC TACAAGAATT 
GAAGCTGCAT TGAACTTTGG 
CCAGACTATC AAGACCCTAT 
TATCAGAACC CTGTCTATGA 
CCAGAAAAAA GATGGGCGAC 



GAGTTATAGG AGAAAGAAGA 
GGTAGGATTT TTAGCTGGGT 
CAAAGAAGCC GTTCAACTGA 
ATTGGATTTT CCAGATGCTA 
TGAACAAGAC CAATTGGTAC 
AAAAACCTAC ACGATTTCTT 
AGCACATGAT TTTGAATATG 
TAGCTTCCTC ATCGTTGAAA 
ACCCAATGAA CTAGGTGTCA 
GCCAAAACCG TACTTTACGT 
AGTAGTCGAA CAATTTGGTG 
TCCGTTCGTG GTAAAAGATT 
TCGCTATTGG GATCACCAGA 
AACATCTACC GCATTGAATC 
TGAACTGGCG CAACAGAATA 
GAACTATTTG CGCTTAAATC 
CCTGCGTAAA GCATTGGCTT 
AGATGGTTCT AAAGCGCTAC 
GGGTCTCGAT TTTCGTCAAG 
AAGTTATTGG AAAAAAGCAC 
AACAGATGAT GGTTCTTACA 
GTTTCCTGGT TTGACAATAG 
GCGAGAAAGT GACTATGATT 
TTCTACCCTG ATGACTTTAT 
CAAATTATTA GATGAAGCAG 
ACTGATTGCA GCTGAAAAAG 



TGAAAAAGTA TTTAAAAATC 
GTACCAATAA AAATGAAAAT 
TGTCACCCTC GGAATTAACA 
TTGTCCAAAC TGCAGCGTTT 
CAGCCGTAGC AAAAGCATTG 
TGAGAAAAGA AGCGGTTTGG 
CTTGGAAAAA AATGATTGAT 
CAATTCAAAA TGGTGCAGAA 
CAGCTGTGGA TGATTATACA 
CCTTGTTAGC TTTTCCGACA 
CGGACTATGG AACTGCTAGT 
GGCAGCAAAC AAAGATGGAC 
ACGTGCGCTC AGACATTATC 
TTTTTGAAGA TGGACAATTA 
AAAATAATAC GTTGTATCAT 
AAAAACGGNA AGGGCAAGCN 
TAGGAATAGA TAAAGAAAAT 
ATGGTGCGAT TACGGAAGGC 
AAGCAGGTAA TTTAATGGTT 
AAGCAGAATT AGGAGAAAAG 
AAAAAATTGG TGAAAGTTTG 
AGCTAACCGC ATTGCCGACT 
TATTCTTAAT TTACTGGACA 
ACAAGGGCAA TGATCGCAAT 
CCACAACCTA TGCCTTAGAG 
AAGTGATTGA AACGACTGCT 
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GGCATGATTC CACTTAGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAATAA 



EF022-2 (SEQ ID NO:82) 

MKKYLKIT MVCILLVGFL AGCTNKNENK KKQKNTKEAV QLMSPSELTT 
LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ FGADYGTASD KWYNGPFW KDWQQTKMDW 
QLAKNNRYWD HQNVRSDIIN YTVIKETSTA LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
YPTATMNYLR LNQKRXGQAT PLANENLRKA LALGIDKENL VNNIIADGSK ALHGAITEGF 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
GSLQELFPGL TIELTALPTE AALNFGRESD YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
QNPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 
NFHTFGAPLT LKNVYKEK 



EF022-3 (SEQ ID NO: 83) 

GT GTACCAATAA AAATGAAAAT 
AAAAAGAAAC AGAAAAATAC CAAAGAAGCC 
ACGCTCAACA CCTCTGTATT ATTGGATTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC 
CCGATGATTT CAGAAGATGG AAAAACCTAC 
AGTAACGATG ATCCTGTCAC AGCACATGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA 
TTAAAGGTGA CGCTCAAAGA GCCAAAACCG 
TTTTTCCCGC AAAATCNAAA AGTAGTCGAA 
GATAAAGTCG TCTATAATGG TCCGTTCGTG 
TGGCAACTAG CAAAAAATAA TCGCTATTGG 
AATTATACAG TTATCAAAGA AACATCTACC 
GATGTGGCTA CACTAAGTGG TGAACTGGCG 
TCGTATCCAA CAGCGACAAT GAACTATTTG 
ACGCCGCTTG CAAACGAAAA CCTGCGTAAA 
CTAGTCAATA ATATTATTGC AGATGGTTCT 
TTTGTGGCGA ATCCCACAAC GGGTCTCGAT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG 
GTTAACGTTG AATTGATGGT AACAGATGAT 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT 
CCAGACTATC AAGACCCTAT TTCTACCCTG 
TATCAGAACC CTGTCTATGA CAAATTATTA 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA 
GGCATGATTC CACTTAGCCA AAATGAACAA 
TTGAATTTTC ATACCTTTGG CGCTCCATTA 



GTTCAACTGA TGTCACCCTC GG AATTAAC A 
CCAGATGCTA TTGTCCAAAC TGCAGCGTTT 
CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
TTTGAATATG CTTGGAAAAA AATGATTGAT 
ATCGTTGAAA CAATTCAAAA TGGTGCAGAA 
CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TACTTTACGT CCTTGTTAGC TTTTCCGACA 
CAATTTGGTG CGGACTATGG AACTGCTAGT 
GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
GATCACCAGA ACGTGCGCTC AGACATTATC 
GCATTGAATC TTTTTGAAGA TGGACAATTA 
CAACAGAATA AAAATAATAC GTTGTATCAT 
CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
TTGACAATAG AGCTAACCGC ATTGCCGACT 
GACTATGATT TATTCTTAAT TTACTGGACA 
ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
GATGAAGCAG CCACAACCTA TGCCTTAGAG 
GCTGAAAAAG AAGTGATTGA AACGACTGCT 
ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
ACGTTAAAAA ATGTTTATAA GGAAAAA 



EF022-4 <SEQ ID NO: 84) 
CTNKNENK KKQKNTKEAV QLMSPSELTT 

LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
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KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ 
QLAKNNRYWD HQNVRSDIIN YTVIKETSTA 
YPTATMNYLR LNQKRXGQAT PLANENLRKA 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK 
GSLQELFPGL TIELTALPTE AALNFGRESD 
QNPVYDKLLD EAATTYALEP EKRWATLIAA 
NFHTFGAPLT LKNVYKEK 

EF023-1 (SEQ ID NO: 85) 



FGADYGTASD KWYNGPFW KDWQQTKMDW 
LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
LALGIDKENL VNNIIADGSK ALHGAITEGF 
KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 



TAAAATGGAG GGATCGGTAT GAAGAAATTA AAAATGTTAG GATGCGTCGG GTTGCTTTTA 
GCTTTAACGG CTTGTCAGGC GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA 
CAAAAAATTG CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 
GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 
GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC TACAGATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 
CAATTGGAAG AAAATCTAGA TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAACTAA 



EF023-2 (SEQ ID NO:86) 

MKKLK MLGCVGLLLA LTACQAGTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 
TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LYWHSFGPTY SLKWAYVN 
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EF023-3 (SEQ ID NO: 87) 

GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA 

CAAAAAATTG CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 
GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 
GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC TACAGATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 
CAATTGGAAG AAAATC TAG A TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATC TGGTTG GGGAACCGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAAC 



EF023-4 {SEQ ID NO:88) 

GTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 

TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFR VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LYWHSFGPTY SLKWAYVN 

EF024-1 (SEQ ID NO:89) 

TAATGGCCGT TTCGTCTACT AATAAAGAGG ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG TACTACCTTT TATTGCCTTA 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAATAA 
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EF024-2 (SEQ ID NO: 90) 

M KKVLPFIALV GLLLLSGCGT DMKKILTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 



EF024-3 (SEQ ID NO:91) 
ATT GACTGCCGAT 

GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAA 



EF024-4 {SEQ ID NO: 92) 



LTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 



EF025-1 (SEQ ID NO: 93) 

TGAATGAAAC ATATTAAAGG AATGTTGGTT 
GCGC CAGATC AAGAGCCAAC GAAACAAACA 
AAGCAAGTTA CCGTCACCAA TCAAACGACT 
AATGACGAAC TGATTGCTAA TCAATTGACT 
GTTACAGGGG CCACACAAAC GACATTTGGA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT 
TATTATAAAA ATGAAGGTGT ATTTACTGGC 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT 
TATGATACAC GCTATTCGGG TGTCAACAAA 
AGCAACACGC GTACAGACGA TACGTTAGTC 
AAACAAATGC GTGACGAAAA TCGTGTTACA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA 
CC ATCGAAAG AAACGTATAT CGGTTACGCA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA 
GATGAACAGG AAAAAATCAC AGAAACAGCC 
ATTCACCAGG ATACAATAAA CAAACCAACA 



TTTATCGGAT TATTTATTTT GGTTGGTTGT 
ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 
TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
TTTGATTCTC ATGAATACAC GTACGAAGTG 
ACAACCCCAC CAGCAAAATA TACACCGGAA 
CAACCGCCTT TGGGATTAAT GACGGGTAAC 
GGAAATTACG GCATTGTAGA GATTATTACG 
GAGTTTACAG AGTTTGCTAG TGATCCTTAT 
CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
GGTAATTTTT ATACGGTACG CGGTTCATCA 
GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
GAAGATTTAG GCAATGGCCT AATCGCTCGA 
AAACATGTCA GCTATGATGA ATACTTTTCA 
TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCTTTTAT TCATTTTGTA G 



EF025-2 (SEQ ID NO: 94) 



MKHIKGMLVF IGLFILVGCA 
DELIANQLTF DSHEYTYEW 
YKNEGVFTGG NYGIVEIITE 
NTRTDDTLVT WNGITYVEK 
SKETYIGYAE DLGNGLIARL 
HQDTINKPTI LLFIL 



PDQEPTKQTT SGPQETKQVK 
TGATQTTFGT TPPAKYTPEE 
PETQRILNVE FTEFASDPYY 
QMRDENRVTG NFYTVRGSST 
QVITEEQKIK HVSYDEYFSD 



QVTVTNQTTS AVEKQAPTKN 
KKKKMFWSNQ PPLGLMTGNY 
DTRYSGVNKR LSDYPEFQAS 
SAREGLMPLA AEMDTWLKEP 
EQEKITETAC GLFIVNRNII 



EF025-3 (SEQ ID NO:95) 
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AAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 

AAGCAAGTTA CCGTCACCAA TCAAACGACT TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
AATGACGAAC TGATTGCTAA TCAATTGACT TTTGATTCTC ATGAATACAC GTACGAAGTG 
GTTACAGGGG CCACACAAAC GACATTTGGA ACAACCCCAC CAGCAAAATA TACACCGGAA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT CAACCGCCTT TGGGATTAAT GACGGGTAAC 
TATTATAAAA ATGAAGGTGT ATTTACTGGC GGAAATTACG GCATTGTAGA GATTATTACG 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT GAGTTTACAG AGTTTGCTAG TGATCCTTAT 
TATGATACAC GCTATTCGGG TGTCAACAAA CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
AGCAACACGC GTACAGACGA TACGTTAGTC ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
AAACAAATGC GTGACGAAAA TCGTGTTACA GGTAATTTTT ATACGGTACG CGGTTCATCA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
CCATCGAAAG AAACGTATAT CGGTTACGCA GAAGATTTAG GCAATGGCCT AATCGCTCGA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA AAACATGTCA GCTATGATGA ATACTTTTCA 
GATGAACAGG AAAAAATCAC AGAAACAGCC TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCACCAGG ATACAATAAA CAAACCAACA ATTCTTTTAT TCATTTTG 

EF025-4 (SEQ ID NO:96) 

TKQTT SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 

DELIANQLTF DSHEYTYEW TGATQTTFGT TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY 
YKNEGVFTGG NYGIVEIITE PETQRILNVE FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 
NTRTDDTLVT WNGITYVEK QMRDENRVTG NFYTVRGSST SAREGLMPLA AEMDTWLKEP 
SKETYIGYAE DLGNGL I ARL QVITEEQKIK HVSYDEYFSD EQEKITETAC GLFIVNRNII 
HQDTINKPTI LLFIL 



EF026-1 (SEQ ID NO:97) 

TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA 

TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT 

GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT 

AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 

CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 

AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 

TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 

CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 

AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CC AC TGTTAC TGGTGAAGTC ACCGTAGCCA ATAATTAA 

EF026-2 (SEQ ID NO: 98) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 

NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 

EF026-3 (SEQ ID NO: 99) 

AACAGATAG TAGTTCTAGT 

AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 

CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 

AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
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TCCAACGATG TCTATCGTAA 
GCTGAATATG AAATCACGGT 
CACGGGACTG TCAAAGGTGA 
AAAGTTAATG GCAATATTAC 
AAAGATGGTG CCACTGTTAC 



ATTAGCACTT TATTCCCAAG 
TCCTAAGCTA ATCGTTTCTT 
TATTGAGGTG AAAGCAAATG 
TTTTGATAAA CAAGAATACA 
TGGTGAAGTC ACCGTAGCCA 



ATGATAATAA AAAAGTAACT 
CTGAAAATTT CAACATCGTT 
GCTTTACTTT AAATGGTACC 
AAGATTCTGC TG AC TTAGAA 
ATAAT 



EF026-4 (SEQ ID NO: 100) 



TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF027-1 (SEQ ID NO: 101) 



TTTGGTATGA AACAGAAAAA 
GCATGTGGAA GTGGCGGTTC 
GTCGCATCTG GTGGTGAACT 
TCCGATATGA TTGGTCAAGT 
GAGCTAGCTA TGGCGAAAGC 
AAGTTACGAG AAGCAAAATG 
GCGTTTAGAA ACGTGGTCGA 
TTTAAAAATG GGCGTGCGGT 
GCAATCGATG ACCAGACACT 
GTCTTGGTTG GGACACCTTT 
GCCTATGGGA CTTCTGCAGA 
GATGGCAATT CCGAAACTTG 
GTAAAATTGA ATGAAATTGA 
TTTGATAATG GCGACTTAGA 
GAGTCAAAAC AAGCGCATTT 
CGCCGTGAAA TTACCGGCAA 
GAAACTTTTG CAAAAGAAAT 
GCTAATTTTG CAAAAATCCA 
TGCCATATAA TATTAAAGAA 



GTGGTTAATC GGACTTGTTG 
GAAAACGACC TCAAACGAAC 
CTCGACATTA GACAGCGCTC 
AGTTGAAGGC TTGTATCGAC 
AGAGCCACAA GTTAGTGAAG 
GACAAACGGG GATCCAGTTA 
TCCAGCATAC GGTTCAAGTA 
GCGGGAAGGA CAAGCCACGA 
AGAACTAACA TTGGAAAATC 
TATGCCTAAA AATGAAGCCT 
TAATTTTGTT GGCAATGGGC 
GAAATTGAAG AAGAATGATC 
TGTTCAAGTA GTGAAAGAAA 
TTACACTGTT TTAGCAGATA 
TGTACCTAAA GCCATGGTGG 
CGAACATGTT CGAAAAGCTT 
TTTAGGAGAT GGCTCGACAG 
GATACAGGTG AAGATTTCCG 
GCCCAAGCTA ACTGGAACAA 



CACTGGGCTT GGTTTTAGCA 
CAGCTACACA GAAAATTAAC 
ATTATACAGA TGTCTATAGT 
AAGATAAAAA CGGAGATCCT 
ACGGGTTAGT CTATACATTC 
AAGCAGGGGA TTTTGTAGTT 
GCAGTAATCA AATGGATATT 
TGGAAGAATT TGGTGTCAAA 
CAATTCCTTA TTTAGCCCAA 
TTGCCAAAGA AAAAGGTACT 
CGTTTGTAAT TTCAGGTTGG 
ATTATTGGGA TAAAGAACAC 
TTGGCACAGG AGCCAATCTT 
CTTATGCACT TCAGTATAAA 
GTTATTTAAG CCCCAATCAT 
TTTTACAAGC GATTGACAAA 
CTTTAAATGG NTTTGTACCA 
CAAAGAAAAT GGTGATTTAT 
TT 



EF027-2 (SEQ ID NO: 102) 



MKQKKWLI GLVALGLVLA ACGSGGSKTT SNEPATQKIN VASGGELSTL DSAHYTDVYS 
SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAKKMVIY 
CHIILKKPKL TGTI 

EF027-3 (SEQ ID NO:103) 



AACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC 

GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC ATTATACAGA TGTCTATAGT 
TCCGATATGA TTGGTCAAGT AGTTGAAGGC TTGTATCGAC AAGATAAAAA CGGAGATCCT 
GAGCTAGCTA TGGCGAAAGC AGAGCCACAA GTTAGTGAAG ACGGGTTAGT CTATACATTC 
AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 
GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA GCAGTAATCA AATGGATATT 
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TTTAAAAATG GGCGTGCGGT GCGGGAAGGA 
GCAATCGATG AC C AG AC ACT AGAACTAACA 
GTCTTGGTTG GGACACCTTT TATGCCTAAA 
GCCTATGGGA CTTCTGCAGA TAATTTTGTT 
GATGGCAATT CCGAAACTTG GAAATTGAAG 
GTAAAATTGA ATGAAATTGA TGTTCAAGTA 
TTTGATAATG GCGACTTAGA TTACACTGTT 
GAGTCAAAAC AAGCGCATTT TGTACCTAAA 
CGCCGTGAAA TTACCGGCAA CGAACATGTT 
GAAACTTTTG CAAAAGAAAT TTTAGGAGAT 
GCTAATTTTG CAAAAATCCA GATACAGGTG 
TGCCATATAA TATTAAAGAA GCCCAAGCTA 



CAAGCCACGA TGGAAGAATT TGGTGTCAAA 
TTGGAAAATC CAATTCCTTA TTTAGCCCAA 
AATGAAGCCT TTGCCAAAGA AAAAGGTACT 
GGCAATGGGC CGTTTGTAAT TTCAGGTTGG 
AAGAATGATC ATTATTGGGA TAAAGAACAC 
GTGAAAGAAA TTGGCACAGG AGCCAATCTT 
TTAGCAGATA CTTATGCACT TCAGTATAAA 
GCCATGGTGG GTTATTTAAG CCCCAATCAT 
CGAAAAGCTT TTTTACAAGC GATTGACAAA 
GGCTCGACAG CTTTAAATGG NTTTGTACCA 
AAGATTTCCG CAAAGAAAAT GGTGATTTAT 
A 



EF027-4 (SEQ ID NO: 104) 



TT SNEPATQKIN VASGGELSTL DSAHYTD 1 
SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV 
VKLNEIDVQV VKEIGTGANL FDNGDLDYTV 
RREITGNEHV RKAFLQAIDK ETFAKEILGD 
CHIILKKPKL 



VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
GSTALNGFVP ANFAKIQIQV KISAKKMVIY 



EF028-1 (SEQ ID NO:105) 



TAACAGAAGC 
AAGACTTGTT 
AGAGCTTTGC 
TCTGAACAGA 
GAAAAAGCAT 
ACAACGGGCT 
TTTGATACCT 
ACCGATTCAG 
ATTGCACTCG 
GGGAAATCAA 
GGCGCACATA 
CAAATCGACG 
CGGAAAGATC 
AAAAAGTCGT 
CTACCTAAAA 
GCTCTTCAAC 
ATTGATTGGG 
GAAGCGGCGT 
CAACTGCAGA 



AATACAACAA 
ATAGTCAATG 
TAGGGGTTAC 
AAAGCGGCGA 
CAGTAAAAAA 
ATCGCTATTT 
ATTTGGTCGG 
CTTCCGCAGC 
ATAATGACAA 
CGGGTCTTGT 
ATGTTTCACG 
GACAACACAA 
GTGATTTAGT 
TAAATGAGAA 
TGATTGACCG 
GGTTAGATAA 
CCGGGCATAG 
TTGAAAAGGC 
TCATTCAACA 



CTTAACACTT 
TATGGGTAGA 
CTTATTAACA 
AAAACAAACA 
TGTTATTTTT 
CAAAGCCAAT 
ACAGCAAGCC 
GACAGCGATG 
GTCCAAAACA 
AGCAACATCT 
CAAAAATATG 
AGTCGATGTG 
CAAAGAATTT 
CCAAGACGAC 
AACGGAAGAA 
AAATGAAAAA 
CAATGATATT 
CATCGATTTT 
GGGGGCTTGT 



TGTTTACTTG 
TATGAAGGAG 
TTCACAACAT 
GAGGTTGCTG 
ATGATTGGAG 
CACTCAGACA 
ACTTATCCAG 
GCTGCCGGAG 
GAAACAGTGC 
GAAATAACAC 
GCAGAAATCG 
TTACTTGGCG 
TCCCAAGCGG 
AAAATTTTAG 
GTCCCTTCAT 
GGTTTCTTTT 
GTTGGCGCGA 
GCCAAAAAAG 
CTTTAG 



TTATTTATCA 
GAAACAAGGA 
TAGCGGGTTG 
AAGCGAAGGC 
ATGGCATGGG 
AGCGTGTTCC 
AAGATGAAGA 
TGAAAACCTA 
TCGAACGTGC 
ATGCAACCCC 
CCGATGACTA 
GCGGCTCCGA 
GTTATGGTCA 
GCTTGTTTGC 
TAGCTGATAT 
TAATGGTTGA 
TGAGCGAAAT 
ATGGTGAACA 



GAAATCAACT 
AATGAAGAAA 
TACAAATTTA 
AACTGAATCT 
GAATCCGTAT 
CCAAACAGCT 
AGAGAATGTC 
TAATAATGCT 
GAAAAAAGTG 
TGCTGCATAT 
TTTTGATGAT 
ATTATTTGCC 
TGTCACAGAC 
ACCAGGCGGG 
GACAGAAGCG 
AGGTAGTCAA 
GCAAGACTTC 
TTGGTGGTTA 



EF028-2 (SEQ ID NO: 106) 



MKKR ALLGVTLLTF TTLAGCTNLS 
EQKSGEKQTE VAEAKATESE KASVKNVIFM 
DTYLVGQQAT YPEDEEENVT DSASAATAMA 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV 



IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
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LQIIQQGACL 

EF028-3 (SEQ ID NO: 107) 

ACAGA AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT 
GAAAAAGCAT CAGTAAAAAA TGTTATTTTT ATGATTGGAG ATGGCATGGG GAATCCGTAT 
ACAACGGGCT ATCGCTATTT CAAAGCCAAT CACTCAGACA AGCGTGTTCC CCAAACAGCT 
TTTGATACCT ATTTGGTCGG ACAGCAAGCC ACTTATCCAG AAGATGAAGA AGAGAATGTC 
ACCGATTCAG CTTCCGCAGC GACAGCGATG GCTGCCGGAG TGAAAACCTA TAATAATGCT 
ATTGCACTCG ATAATGACAA GTCCAAAACA GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GGGAAATCAA CGGGTCTTGT AGCAACATCT GAAATAACAC ATGCAACCCC TGCTGCATAT 
GGCGCACATA ATGTTTCACG CAAAAATATG GCAGAAATCG CCGATGACTA TTTTGATGAT 
CAAATCGACG GACAACACAA AGTCGATGTG TTACTTGGCG GCGGCTCCGA ATTATTTGCC 
CGGAAAGATC GTGATTTAGT CAAAGAATTT TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAAAGTCGT TAAATGAGAA CCAAGACGAC AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
CTACCTAAAA TGATTGACCG AACGGAAGAA GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GCTCTTCAAC GGTTAGATAA AAATGAAAAA GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
ATTGATTGGG CCGGGCATAG CAATGATATT GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GAAGCGGCGT TTGAAAAGGC CATCGATTTT GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CAACTGCAGA TCATTCAACA GGGGGCTTGT CTT 

EF028-4 (SEQ ID NO: 108) 

QKSGEKQTE VAEAKATESE KASVKNVIFM IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
DTYLVGQQAT YPEDEEENVT DSASAATAMA AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
LQIIQQGACL 

EF029-1 (SEQ ID NO:109) 

TGAAGGAGGG AGAAAATGAA AAAGTTAATC GGTAAAAAGT GGCTGCTGCT TACAGCAGTA 
GCCACTTTTT TATTATCAGG ATGCGCAAGT CTTGAACAAA AAGCACAGGA TAGTGTAAAA 
GAAGTTACTG AAAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 
GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA ACAAAAAGTT 
CGGTTTTTAT TAATTGACAC ACCCGAGACT GTGAAACCGA AAACAAAAGT TCAGCCGTTC 
GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA CTGCTTCAGA AATTACGTTT 
GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG CGTTGGGCTA CATATTCGTA 
GATGGAACAT TACTACAAAA AACGCTTGTA AGTGAAGGAT TAGCTCGTGT TGCCTATGTA 
AAAGAGCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG CCCAAGAACA GGCTAAAAAT 
GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC AACGGGGGTT TAGTAAATAA 

EF029-2 (SEQ ID NO: 110) 

MKKLIG KKWLLLTAVA TFLLSGCASL EQKAQDSVKE VTENVTQTIS NDQRIPADFV 
RHVDGDTTVL KIDGKEQKVR FLLIDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 
SLGIWSIPGY VTQRGFSK 

EF029-3 (SEQ ID NO: 111) 

AAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 

GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA ACAAAAAGTT 
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CGGTTTTTAT TAATTGACAC ACCCGAGACT GTGAAACCGA AAACAAAAGT TCAGCCGTTC 
GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA CTGCTTCAGA AATTACGTTT 
GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG CGTTGGGCTA CATATTCGTA 
GATGGAACAT TACTACAAAA AACGCTTGTA AGTGAAGGAT TAGCTCGTGT TGCCTATGTA 
AAAGAGCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG CCCAAGAACA GGCTAAAAAT 
GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC AACGGGGGTT TAGTAAA 



EF029-4 (SEQ ID NO: 112) 

NVTQTIS NDQRIPADFV 
RHVDGDTTVL KIDGKEQKVR FLLIDTPETV 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS 
SLGIWSIPGY VTQRGFSK 

EF030-1 (SEQ ID NO:113) 



KPKTKVQPFG LEASKRTKEL LSTASEITFE 
EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 



TGATTGACAC ATAGGGGGAA TAGTATGAAA AAGTTAAAAA TGATGGGGAT TATGTTATTT 
GTTAGTACGG TCTTGGTAGG TTGTGGCACA ACAGCAGANA CAAAAATAGA CGAGAAAGCA 
ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TGGAGAACTC GGAAATCGGT 
TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATGTCTTT 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 
CCAGAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
GAAATGATTG TATCTAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 
TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 
CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 
GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT 
TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTGTATATA ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT GATAGTGACC GA 

EF030-2 (SEQ ID NO:114) 

MKK LKMMGIMLFV STVLVGCGTT AXTKIDEKAT EKTSVSKKVL NLMENSEIGS 

MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS 

NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL 

EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 

DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA 

IERSKVYSLR LNQKRNEKPS IFANENVRKA L.AYALDKKSL VDNILADGSK EIYGYIPEKF 

VYNPETNEDF RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDGD SDR 

EF030-3 (SEQ ID NO:115) 

GAGAAAGCA 

ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TGGAGAACTC GGAAATCGGT 
TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATGTCTTT 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 
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CCAGAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
GAAATGATTG TATCTAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 
TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 
CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 
GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT 
TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTGTATATA ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT 

EF030-4 (SEQ ID NO:116) 



EKAT EKTSVSKKVL NLMENSEIGS 

MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ 

NGDAVTANDF VFAWRKLANP KNQANYFFLL 

EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE 

DFVRNPYYYD KEKVKSETIH FEVLKETNTV 

IERSKVYSLR LNQKRNEKPS IFANENVRKA 

VYNPETNEDF RQEAGALVKT DAKKAKEYLD 

EF031-1 (SEQ ID NO: 117) 



LIPAAAKEMP EISEDGKRYT IKLREDGKWS 
EGTILNGTAI TKEEKAPEEL GVKALDDYTL 
KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 
YNLYESGELD VAVLTGDFAK QNRDNPDYEA 
LAYALDKKSL VDNILADGSK EIYGYIPEKF 
KAKAELNGDV AIELLSRDG 



TGAGAAATTA GTTATTTTAG AAAAATAAAA ACCATTTTGG AGGAAGATTT AAAAATGAAA 
AAACGCGTAA TTTTAGGGAC ATTAGTCGCT GCAACGTTAT TAATGACTGC TTGTGGAAAC 
AGCGAAGCAA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT ACAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAACG TGA 

EFO031-2 (SEQ ID NO: 118) 

MKK RVILGTLVAA TLLMTACGNS EATTKSESKG GSNALWSTF 

GLSEDIVKKD I I APFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQKT 

EF031-3 (SEQ ID NO:119) 
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AA CTACGAAAAG C GAG AG C AAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT ACAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAA 

EF031-4 (SEQ ID NO:120) 

TTKSESKG GSNALWSTF 

GLSEDIVKKD I I APFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQK 



EF032-1 (SEQ ID NO:121) 

TGAATAAATT ATTTAGGAGG AATTATGATG 
GTTTGTGGTA TTTCACTACT TACTGCTTGT 
AAGTCAACCA GTCAATCTAG CAGCACAGTT 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT 
CAACTAGATG ATAAATCGAG CATAGTTCTT 
AACCATGACT TATACGGAGT GGGAAACAAG 
CTCACATTCG AAACTGCCGA CAAAGATTTT 
CAAGAATATA TGAAAAATCC AGTATCNAGT 
TCTGATGTTT CTAAAAGTAG TAGCCAAGAT 
GAAGTAAGTA CTGAAGCGAA GTCTGATGTA 
AATACTAATG ACATTACTAA GCTAGCAGAT 
GATACTTTAG CTAAGCATCA ATTTAATGAT 
TCAATTATCG GCGTCATCCC AACCATGGAC 



AAAAAATTAA TTAGTTTAGG ATTGGTTTGT 
NCGGGAAATA ATGATAATAA AGATACTGAA 
AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
GATCCTGGAG ATTACTATGC TGTATTAACT 
ATTACCGTCA AATCAGGCGG AGAAAATAGT 
AAAAAAGTAT CTCTTAAAAA GGGAGATACT 
GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
ACTGAAACTA GCAAACANAA AACAGTAAAC 
AATAAACAAT CTGATGTATC TGAAAAAAAA 
GCTACTAATA CTTTACCGAG CGAAGATAAA 
GAGCCAACCT TAGAACAACA AACCGTCTTA 
ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
GCAAAAAGAT GGTAA 



EF032-2 (SEQ ID NO:122) 

MK KLISLGLVCV CGISLLTACX GNNDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
VSTEAKSDVA TNTLPSEDKN TNDITKLADE PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 
IIGVIPTMDA KRW 



EF032-3 (SEQ ID NO: 123) 
TA ATGATAATAA AGATACTGAA 

AAGTCAACCA GTCAATCTAG CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT 
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CAACTAGATG ATAAATCGAG CATAGTTCTT ATTACCGTCA AATCAGGCGG AGAAAATAGT 

AACCATGACT TATACGGAGT GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGGAGATACT 

CTCACATTCG AAACTGCCGA CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT 

CAAGAATATA TGAAAAATCC AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC 

TCTGATGTTT CTAAAAGTAG TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA 

GAAGTAAGTA CTGAAGCGAA GTCTGATGTA GCTACTAATA CTTTACCGAG CGAAGATAAA 

AATACTAATG ACATTACTAA GCTAGCAGAT GAGCCAACCT TAGAACAACA AACCGTCTTA 

GATACTTTAG CTAAGCATCA ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT 

TCAATTATCG GCGTCATCCC AACCATGGAC GCAAAAAGAT GG 



EF032-4 (SEQ ID NO: 124) 

NDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST 
VSTEAKSDVA TNTLPSEDKN TNDITKLADE 
IIGVIPTMDA KRW 



TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 



EF033-1 (SEQ ID NO:125) 

TGACTGCTTT TTTTCTATTG GAGAAAAAAG TGGTTTTTTT GTATTGTTTT GACGTTGAGA 
CAAAGGAGGT TCATTTCAGA AAATTTTCCC CAAAATAAAA TAGACGAATG CGAGGATGAA 
AAAATGAAAA AATTTACTTT AACAATGATG ACTTTAGGTT TAGTAGCAAC ACTTGGCTTA 
GCAGGATGTG GTAAACAGGA AAAGAAAGCA ACTACCTCTT CTGAAAAAAC AGAAGTAACG 
TTACCAACCA AAGACCGTAG CGGCAAAGAA ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATCGCAGTTG ATACTCAAAG TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
ATCGCTGATA GCTTATCTGA ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA 
GAAATCGACG AGTAG 



EF033-2 (SEQ ID NO:126) 
MKKFTLTMMT LGLVATLGLA 

GCGKQEKKAT TSSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI 
AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT 
WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 

EF033-3 (SEQ ID NO: 127) 

CTCTT CTGAAAAAAC AGAAGTAACG 

TTACCAACCA AAGACCGTAG CGGCAAAGAA ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATCGCAGTTG ATACTCAAAG TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
ATCGCTGATA GCTTATCTGA ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA 
GAAATCGACG AGTAG 
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EF033-4 (SEQ ID NO: 128) 



SSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI 

AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT 

WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 



EF034-1 (SEQ ID NO:129) 



TAGGAGGGAG TAATCATGAA 
TTGGTAGGTT GTAGTAATAA 
CCTTTAATAC TCACCACGAT 
TTCAACAAGG ATAAAACCAT 
GACACAAAAA AAACAAGTAG 
AATAAAGAGA GCTATGAAAT 
AAAAAAGTTG ATGAAGGTAA 
GGTGGTAAAT AG 



AAAAATCGGG TATTTTAGTT 
CAAAAAAGAA AACGGCAATC 
TATTGAAAAA GAAGAAGACC 
GACGCTTGAA AAAGAATATT 
AACAGAAAAA AAGGTATATA 
TATAGGTCAA TTGGACAAAA 
ACGTATATCT GATGCAGAAG 



GTATTATTTT TTTCATGTTT 
TTTTGAATGC CAGTTCGTTT 
TAACGAAAGG TTCAATTTTT 
TAGTTAATCC CAATAATGAA 
AAAATATTAA AATACAAGAA 
AAACGAAAAA AATAGAGTTT 
GTAATGTGTA TGGTGATTTT 



EF034-2 (SEQ ID NO:130) 



MKKIGY FSCIIFFMFL VGCSNNKKEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF 
NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 



EF034-3 (SEQ ID NO: 131) 



AGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT 

CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 
TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT TAGTTAATCC CAATAATGAA 
GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 
AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 
AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-4 (SEQ ID NO:132) 



KEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF 

NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 



EF035-1 (SEQ ID NO:133) 



TAAACGAGAG GTGAGTTTAT GAAAACAAAA 
TTATTCACAA GTTTCCTTTT ACTGAGTGGT 
ACAATTGATC GACAGAAAGA AAAAGTCGAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA 
AAAATCGATA CTACTGAGTA A 



ATCGGAAAAA CAGTTATCTT GTCAGCATTT 
TGTACCTCGG CTGGCGAAGA GATGGAAAAA 
AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GTTGACCGTT CTTTAGATAG TCAAGAAGAC 



EF035-2 (SEQ ID NO: 134) 

MKTKI GKTVILSAFL FTSFLLLSGC TSAGEEMEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 
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EF035-3 (SEQ ID NO:135) 
GATGGAAAAA 

ACAATTGATC GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG TCAAGAAGAC 
AAAATCGATA CTACTGAG 

EF035-4 (SEQ ID NO:136) 



MEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 



EF036-1 (SEQ ID NO: 137) 

TAATTTTCAA GTCCTACATA TAATGGTAAA ATAGAATGGA TTGAAATTAA TTGGAGGAAT 
AATGAATCGA TGAAAAAAAG ATTGCTATTA TTTATTGGTT TGGCAAGTAT ACTTACTTTG 
ACAGGATGTG CAAAATGGAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 
TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 
GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT 
AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA 
GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT 
TGGCAAGGAA ATGTCATTAA ATAA 

EF-36-2 (SEQ ID NO:138) 

MKKRLLLF IGLASILTLT GCAKWIDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
DHKVAWGIT PIVNKNVGVK DISMENLKKI FLGEVTNWKE LGGKDQKIVI LNRAAGSGTR 
ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA FSYVTDEVAT LSIDGVQPTD 
ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE RVIGQLGYIP VSKMEIERDW 
QGNVIK 

EF036-3 (SEQ ID NO: 139) 

GAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 

TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 
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GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 

GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT 

AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA 

GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT 

TGGCAAGGAA ATGTCATTAA A 



EF036-4 (SEQ ID NO:140) 
IDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG 
DHKVAWGIT PIVNKNVGVK DISMENLKKI 
ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS 
ENVMNNKWII WSYEHMYTRK NPSDLTKEFL 
QGNVIK 



LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
FLGEVTNWKE LGGKDQKIVI LNRAAGSGTR 
DTPGAISYTA FSYVTDEVAT LSIDGVQPTD 
DFMLSDDIQE RVIGQLGYIP VSKMEIERDW 



EF037-1 (SEQ ID NO:141) 

TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA 
TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT 
GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAA 

EF037-2 (SEQ ID NO:142) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 

EF037-3 (SEQ ID NO:143) 

AACAGATAG TAGTTCTAGT 

AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA A 



EF037-4 (SEQ ID NO: 144) 
TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI 



GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
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VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF038-1 (SEQ ID NO: 145) 



TAATGGCCAT TTCGTCTACT AATAAAGAGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA 
GGTGGTAAAT GGGAACTAGA AAATAAAAGT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT 
AATAATAAAA AACTCACTTT GGATATAAAA 
GAATATAAAG ACGGTAAATT AAAAGGTGAA 
TNGAATAAGA GGTGTCTTTG A 



ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
ATGAAAAAAG TACTACCTTT TATTGCCTTA 
ACAGATATGA AAAAGATATT GACTGCCGAT 
CCAACTACTA CTTACACTTT TTTTGATGAT 
AGTGATAGTG GAACGTACTC TTACGATGAA 
AATAAAGAAC AATTAATAAT GGAAAATGTT 
ATTGGAGGCG AGAAGGACTC TGATAAAAAA 



EF038-2 (SEQ ID NO:146) 



M KLLKWRWQWN KDHKKGEVSM KKVLPFIALV GLLLLSGCGT DMKKILTADG 
GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 
YKDGKLKGEI GGEKDSDKKX NKRCL 



EF038-3 (SEQ ID NO:147) 



TTGTGGA ACAGATATGA AAAAGATATT GA< 
GGTGGTAAAT GGGAACTAGA AAATAAAAGT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT 
AATAATAAAA AACTCACTTT GGATATAAAA 
GAATATAAAG ACGGTAAATT AAAAGGTGAA 
TNGAATAAGA GGTGTCTTTG A 



CCAACTACTA CTTACACTTT TTTTGATGAT 
AGTGATAGTG GAACGTACTC TTACGATGAA 
AATAAAGAAC AATTAATAAT GGAAAATGTT 
ATTGGAGGCG AGAAGGACTC TGATAAAAAA 



EF038-4 (SEQ ID NO: 148) 
CGT DMKKILTADG 

GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 
YKDGKLKGEI GGEKDSDKKX NKRCL 



EF039-1 (SEQ ID NO:149) 



TAAATATATC AAAAAGAAAA AAGGGGATTA 
GCGCTTACCT TATTAACCTT TAGTACGTTG 
TCTGCAACAG ATAAATCAAG TGCAGCTAGC 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA 
CAAGGCACAA AAGTTTACGA CAAAAATNAT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC 
TTAATTTCGG ATACACAAAA CTATCAAGCG 
AAATTTACCT ATAAGCGAAT GGGTAAAGAT 
GAACATATCC CTTATTCTGA CGAGAAATTA 
ACAGAAACTG GCAAGATTGT TACCAATGAA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC 
TTTATTAGTT TAGCGAAATT TGATAATAAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC 
GCTCACGTTT CAATTGGTGA CAATAAATAT 
GATAAACGTT TTACGTATAC ACGAATGGGT 



CCAACCATGA AAAAGAAAAA AGTTTTTAGT 
TTGATTGCAG GCTGTGCTGG CGGAGCCAAC 
TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
TTAACAGAAA TTTTATCCAG TACTGATTGG 
AATAATTTAA CAGCAGAAAA TGCTAATTTT 
GGTTTTTATG AATTTTTCGA CAAAGAAACA 
TTTGTGACAG ACGATGGCGA AAAGCGTATC 
GTGGTCGATT TAACGGAAGT GACGAAAGAT 
AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
CCTGGGGATG ACATTTTAGG GGCCACATTA 
GGTAACGATG TTACTGAAGC AAATAAAATG 
ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
AAAGACAACA ATGGCAAAGA AATTAAAGTC 
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TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 
EF039-2 (SEQ ID NO: 150) 

MKKKKVFSA LTLLTFSTLL IAGCAGGANS ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANFI GLAKYDGETG FYEFFDKETG 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF 
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG AALELTELND 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF039-3 (SEQ ID NO: 151) 

TGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 

GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG 

CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT 

ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA 

GGTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC 

TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT 

AAATTTACCT ATAAGC GAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA 

GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA 

ACAGAAACTG GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA 

TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG 

TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA 

ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT 

GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT 

GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGGCAAAGA AATTAAAGTC 

TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 



EF039-4 (SEQ ID NO:152) 

ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP 
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF 

EF040-1 (SEQ ID NO:153) 



NLTAENANFI GLAKYDGETG FYEFFDKETG 
VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 
GDDILGATLW NGTKVLDEDG NDVTEANKMF 
QVIDNNKIRA HVSIGDNKYG AALELTELND 
TPDFTF 



TAGATTAGAA CCACTGGAGA AAAATCTCAT ATTTCTCTCG AGGAAAGGAA GTTGAGCACA 
ATGAACAAAA AAATTTTAAT GGGGC TATTA AGTGTCGTGA CCATTCCATT ACTTGCTGCG 
TGTCAAGGAG GAGAAACACC TTCCGCAGCG TCAAAAAATA GTCAAACGGT GACTACTCAA 
AGTAGTGCAA AAACTGAAAG CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA 
GAGGAAGTGA AAGAACCGAT GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT 
CAACGAGATA CGAAGAAGGT CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA 
GTCGACACAG AAGGCAACAC ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT 
GCAAAAGCGT TGATTGATCG GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT 
CCCTATCTTT ATGCGGGAGC GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT 
GCGACCCCAG ATTTAAATAA GCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT 
GAAAAAGGAC ATATTGACAA TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT 
TTCCAAAATG ACTTTGGCTA TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC 
CAACTTTACC AAGATATTGT AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA 
GACAATTCTG GTCGAACAGC AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT 
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AAAATTTTAG CACAGTACAA CTAA 
EF040-2 (SEQ ID NO: 154) 

M NKKILMGLLS WTIPLLAAC QGGETPSAAS KNSQTVTTQS 

SAKTESTSTT RSVAQTTSKE EVKEPMKTYE VGALLEAANQ RDTKKVKEIL QDTTYQVDEV 
DTEGNTPLNI AVHNNDIEIA KALIDRGADI NLQNSISDSP YLYAGAQGRT EILAYMLKHA 
TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI EAVGLREGNQ 
LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN 

EF040-3 (SEQ ID NO: 155) 

AGCG TCAAAAAATA GTCAAACGGT GACTACTCAA 

AGTAGTGCAA AAACTGAAAG CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA 
GAGGAAGTGA AAGAACCGAT GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT 
CAACGAGATA CGAAGAAGGT CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA 
GTCGACACAG AAGGCAACAC ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT 
GCAAAAGCGT TGATTGATCG GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT 
CCCTATCTTT ATGCGGGAGC GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT 
GCGACCCCAG ATTTAAATAA GCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT 
GAAAAAGGAC ATATTGACAA TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT 
TTCCAAAATG ACTTTGGCTA TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC 
CAACTTTACC AAGATATTGT AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA 
GACAATTCTG GTCGAACAGC AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT 
AAAATTTTAG CACAGTACAA C 

EF040-4 ( SEQ ID NO: 156) 

AS KNSQTVTTQS 

SAKTESTSTT RSVAQTTSKE EVKEPMKTYE VGALLEAANQ RDTKKVKEIL QDTTYQVDEV 
DTEGNTPLNI AVHNNDIEIA KALIDRGADI NLQNSISDSP YLYAGAQGRT EILAYMLKHA 
TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI EAVGLREGNQ 
LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN 



EF041-1 (SEQ ID NO:157) 

TAATTATTAA NTTCTGATTT TTCAGAAAAT 
ATGAAATTGA AAAAGTCATT AACATTCGGT 
GCGGCTTGTG GAGGCGGCGG AACGTCAGAT 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA 
TTAGACAAAG ATAACAAAGT CCAACCTGCA 
GATGGACTAA CATACAAAAT TAAATTAAAT 
GTGACTGCTA ATGACTATGT TTACGGATGG 
GAATATGCTT ATCTGTATGC CTCTGTAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC 
AAAGCAACAC CATACTTTGA TTACTTATTA 
GACATTGTGG AAAAATATGG TAAAAATTAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT 
AAAAACGATC AATATTGGGA TAAAGATACT 
GTGAAAGAAT CACCAACCGC GTTGAACTTG 
CTTTCTGGTG AATTAGCCCA ACAAATGGCC 
GCATCAACAC AATATATGGA ACTAAATCAA 



ACAGATTGCA TTATTTTAGG AGGCAACACT 
GTGATTACAT TATTTAGCGT AACAACTTTA 
AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
CAACAAGAAA TGCCAACAGC TGATTTATCA 
TTAAATAATG TATATGAAGG AATTTATCGT 
GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
AAAGATGCAA AATGGTCAGA CGGTAAACCA 
CAACGAACAG TTGATCCAGC GACAGCTTCT 
AATGGTGATG CCATTGCTAA AGGGGAAAAA 
AGTGATACAG AATTAGAAAT CACTTTAGAA 
GCTTTCCCAT CATTCTTCCC GCAACGTCAA 
GCATCAAACA GCGAAAGTGC TGTCTACAAT 
CCTGGTACAG ATACAAAATG GTCATTCAAG 
GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
TTCCAAGATG GACAAACAGA CGATGTCGTT 
AATGACCCAG CTTTTGTTAG TCAAAAAGAA 
CGTGATGAAA AATCACCATT TAGAAATGCG 
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AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 



EF041-2 (SEQ ID NO: 158) 

M KLKKSLTFGV ITLFSVTTLA ACGGGGTSDS SSASGGGKAS 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD 
KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 
SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN LRKAISYSID RKALVESILR 
GW 



EF041-3 (SEQ ID NO:159) 

TTGTG GAGGCGGCGG AACGTCAGAT AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA CAACAAGAAA TGCCAACAGC TGATTTATCA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA TTAAATAATG TATATGAAGG AATTTATCGT 
TTAGACAAAG ATAACAAAGT CCAACCTGCA GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
GATGGACTAA CATACAAAAT TAAATTAAAT AAAGATGCAA AATGGTCAGA CGGTAAACCA 
GTGACTGCTA ATGACTATGT TTACGGATGG CAACGAACAG TTGATCCAGC GACAGCTTCT 
GAATATGCTT ATCTGTATGC CTCTGTAAAA AATGGTGATG CCATTGCTAA AGGGGAAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC AGTGATACAG AATTAGAAAT CACTTTAGAA 
AAAGCAACAC CATACTTTGA TTACTTATTA GCTTTCCCAT CATTCTTCCC GCAACGTCAA 
GACATTGTGG AAAAATATGG TAAAAATTAT GCATCAAACA GCGAAAGTGC TGTCTACAAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT CCTGGTACAG ATACAAAATG GTCATTCAAG 
AAAAACGATC AATATTGGGA TAAAGATACT GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
GTGAAAGAAT CACCAACCGC GTTGAACTTG TTCCAAGATG GACAAACAGA CGATGTCGTT 
CTTTCTGGTG AATTAGCCCA ACAAATGGCC AATGACCCAG CTTTTGTTAG TCAAAAAGAA 
GCATCAACAC AATATATGGA ACTAAATCAA CGTGATGAAA AATCACCATT TAGAAATGCG 
AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 

EF041-4 (SEQ ID NO:160) 
CGGGGTSDS SSASGGGKAS 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD 
KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 
SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN LRKAISYSID RKALVESILR 
GW 

EF044-1 (SEQ ID NO:161) 

TAAGATAAAA TTAGTTATAG CGTCTATAGG AGGAATAGTA TGAAAAAATT AGTTTGTGTT 
ATTTTAGTTA TTTTTTTAAC AGGTTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 
GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
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GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 
GTAATTCAAT AA 

EF044-2 (SEQ ID NO: 162) 

MKKLVCVI LVIFLTGCSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI 

ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG 

MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS 

IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 

IQ 

EF044-3 (SEQ ID NO:163) 

TTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 

GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 
GTAATTCAA 



EF044-4 (SEQ ID NO:164) 

CSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI 
ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA 
MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE 
IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL 
IQ 

EF045-1 (SEQ ID NO:165) 

TAGCCAAAAA ATGAGGGAGG AAAAGAGATG AACAAGAAAC GGATTTTAGG TGCAATCACG 
TTAGCTTCTG TGTTAGTATT CGGGTTAGCT GCATGTGGTG GCGGCAATAA AGGCGGGGGC 
AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 



TPYAVDLSSL NNPLVFNFKG 
LSINTIPTKE IRIFSAADNS 
ITPNYAGNVT DDQKDVMLEV 
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GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTCACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTGAGTCAG AAGAAAATAC TAAATTACTT GATGCGATTG ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAATAA 

EF045-2 {SEQ ID NO:166) 

MN KKRILGAITL ASVLVFGLAA CGGGNKGGGN KATETEDISK MPIAVKNDKK 
AIDGGTLDVA WMDTQFQGL FQQEFYQDNY DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 
LDEDANTATI KLRDNLKWSD GKDVTADDVI FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 
DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
RKNPVTIGPY YMSNIVTGES VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 
ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 
AMGYAIDNDA VGQKFYNGLR TGATTLIPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD 
VDGDGIREDK EGKPLEIKFA SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
EKRKKAFYDW QEYAIDEAFV I PTLYRNEVL PVNDRWDFT WAVDTKDNPW ATVGVTADSR 
K 

EF045-3 (SEQ ID NO: 167) 
ATGTGGTG GCGGCAATAA AGGCGGGGGC 

AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 
GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
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GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTCACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTGAGTCAG AAGAAAATAC TAAATTACTT GATGCGATTG ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAA 



EF045-4 (SEQ ID NO: 168) 
CGGGNKGGGN KATETEDISK MPIAVKNDKK 

AIDGGTLDVA WMDTQFQGL FQQEFYQDNY DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 

LDEDANTATI KLRDNLKWSD GKDVTADDVI FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 

DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 

RKNPVTIGPY YMSNIVTGES VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 

ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 

AMGYAIDNDA VGQKFYNGLR TGATTLIPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD 

VDGDGIREDK EGKPLEIKFA SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 

DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG PNSAFNYTRF ESEENTKLLD AIDSKASFDE 

EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL PVNDRWDFT WAVDTKDNPW ATVGVTADSR 
K 



EF046-1 (SEQ ID NO: 169) 

TAGGAGGATA TAATGAAAAA AAAACTTATT 
TGTAGTAATA ATACTGGGGG AAAAAATAGC 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT 
ACATCATCTA TAACAATTGA AACAACCGAG 
GATGATGTTT CAAAAACTAG ACGACAATTG 
ACGGATAAAG AACTAAAGGA ATATATATCA 
AATTATATTA AGCAAAAA 



GTACTATTGT TAGCCTTATT TTTAACGGCA 
GACGCTTCAT CTACTGAAGT ATCAACTAAG 
AGTAGTAATC CGGACACAAC ACCAACTTCT 
AATTTAAAGA ATAGAGAATT GAATCCAACA 
TATGAACAAG GAATTAACAG TTCAACAATT 
GAGGCTAAAG AACAAAAGAA AGATGTCATT 



EF046-2 (SEQ ID NO:170) 

MKKKLIV LLLALFLTAC SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 
SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 

EF046-3 (SEQ ID NO:171) 
A 

TGTAGTAATA ATACTGGGGG AAAAAATAGC GACGCTTCAT CTACTGAAGT ATCAACTAAG 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT AGTAGTAATC CGGACACAAC ACCAACTTCT 
ACATCATCTA TAACAATTGA AACAACCGAG AATTTAAAGA ATAGAGAATT GAATCCAACA 
GATGATGTTT CAAAAACTAG ACGACAATTG TATGAACAAG GAATTAACAG TTCAACAATT 
ACGGATAAAG AACTAAAGGA ATATATATCA GAGGCTAAAG AACAAAAGAA AGATGTCATT 
AATTATATTA AGCAAAAA 
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EF046-4 (SEQ ID NO:172) 

C SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 

SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 

EF047-1 (SEQ ID NO: 173) 

TAGGGAAAAC AAGGAGGAAT TCTTATGAAA AAGATAGGGC TTATTTCTAG TGCTTTTCTT 

TTAACCCTTG CTTTAGCAGC ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 

TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 

AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 

TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 

TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 

AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 

TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA 

GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 

TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 

GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 

GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 

AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 

GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 

GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 

EF047-2 (SEQ ID NO: 174) 

MKK IGLISSAFLL TLALAACGGG KSTENTDSRS SAAESTTVES TKASATKESS 
SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR VNQLTPISSF 
EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
INGESPDDLA KNWNILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 

EF047-3 (SEQ ID NO: 175) 

ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 

TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 



EF047-4 (SEQ ID NO:176) 
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CGGG KSTENTDSRS SAAESTTVES TKASATKESS 

SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR VNQLTPISSF 
EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
INGESPDDLA KNWNILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 



EF048-1 (SEQ ID NO:177) 

TAAGGAGAAA AGTTCATGAA AAAAAGAAAG GTTTTATTTA CAGCAGTTAT GGTATTGGCA 
GGATTACAGT TGCTAAGTGG TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC 
TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-2 (SEQ ID NO: 178) 

MKKRKV LFTAVMVLAG LQLLSGCGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 
LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 

EF048-3 (SEQ ID NO: 179) 

TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC 

TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA. ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
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AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-4 (SEQ ID NO:180) 

CGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 

LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 



EF049-1 (SEQ ID NO: 181) 

TGAGACTCTT TCTTTTTCAA AATGAGGTAT GGTATAGTTA TAACAGANAT AAAACTANAA 
AAAACAGGAG TGCATAAGAG AATGAAGAAA AAACTAATCT TAGCTGCAGC GGGCGCAATG 
GCCGTTTTTA GTTTAGCAGC GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 
TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 

EF049-2 (SEQ ID NO:182) 

MKKK LILAAAGAMA VFSLAACSSG SKDIATMKGS 

TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF EEKYGDKVTD KXIQKNFDEA KEQVEAQGGK 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK AHLKITDEDL KTAWASFHPE VEAQIIQVAS 
EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT KKDGGKIKFD SQATTVPAEV KEAAFKLKDG 
EVSEPIAATN MQTYQTTYYV VKMTKNKAKG NDMKPYEKEI KKIAEETKLA DQTFVSKVIS 
DELKAANVKI KDDAFKNALA GYMQTESSSA SSEKKESKSS DSKTSDTKTS DSEKATDSSS 
KTTESSSK 

EF049-3 (SEQ ID NO: 183) 

GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 

TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
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AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 



EF049-4 (SEQ ID NO: 184) 
CSSG SKDIATMKGS 

TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK 
EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT 
EVSEPIAATN MQTYQTTYYV VKMTKNKAKG 
DELKAANVKI KDDAFKNALA GYMQTESSSA 
KTTESSSK 



EEKYGDKVTD KXIQKNFDEA KEQVEAQGGK 
AHLKITDEDL KTAWASFHPE VEAQIIQVAS 
KKDGGKIKFD SQATTVPAEV KEAAFKLKDG 
NDMKPYEKEI KKIAEETKLA DQTFVSKVIS 
SSEKKESKSS DSKTSDTKTS DSEKATDSSS 



EF050-1 {SEQ ID NO: 185) 

TAGGGTCTGG AAAAGCAGTC AACTGACTTC TTTTCCAAGC CCTTTTTTAG TTCATCGCAG 
AAAGGATGNA AAAAAATGAA CATGCCCAAA AATATCNGTT ATTTTTCTTT GCTAATGGGT 
CTTGTTCTAT TATTAAGTGC TTGCCAAATT GGGGC AACTA CGAAGGATGA CAACCAAGCC 
GCCACAAAAG AAGCAACTGT TGAGTTAAAC CGCACAACAA CACCAACGCT TTTTTTTCAT 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG GGACCGTGGT TAAAGAGCGA 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG TTCTATTTGA AGATAATAAA 
AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT TACTCTATTT ACAAAAAAAT 
TATCAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA TTGAAAAATT CGTCAGCATT 
GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC AAACCATCGA AACGG AACTA 
GAAAACGGCC CCACAGAAAA AAGTAGCCGC TATTTGGATT ATCAAGAGAT GATTAATGTT 
GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT TAAGTCCAAC AGATTTAAGT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC AACATAGTCA ATTACATGAA 
AATCCTGAAG TAGATCAATT GCTAATCGAA TTTCTATGGC CGAGTAAAAA ATAG 

EF050-2 (SEQ ID NO:186) 

MNMPKN IXYFSLLMGL VLLLSACQIG ATTKDDNQAA 

TKEATVELNR TTTPTLFFHG YAGTKNSFGS LLHRLEKQGA TTQELVLLVK PDGTWKERG 
ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY QVNKANIVGH SMGGVSGLRY 
LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE NGPTEKSSRY LDYQEMINW 
PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNALLRQRGT QVTSQIIKGE NAQHSQLHEN 
PEVDQLLIEF LWPSKK 



EF050-3 (SEQ ID NO: 187) 
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TTGCCAAATT GGGGCAACTA CGAAGGATGA CAACCAAGCC 

GCCACAAAAG AAGCAACTGT TGAGTTAAAC CGCACAACAA CACCAACGCT TTTTTTTCAT 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG GGACCGTGGT TAAAGAGCGA 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG TTCTATTTGA AGATAATAAA 
AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT TACTCTATTT ACAAAAAAAT 
TATCAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA TTGAAAAATT CGTCAGCATT 
GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC AAACCATCGA AACGGAACTA 
GAAAACGGCC CCACAGAAAA AAGTAGCCGC TATTTGGATT ATCAAGAGAT GATTAATGTT 
GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT TAAGTCCAAC AGATTTAAGT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC AACATAGTCA ATTACATGAA 
AATCCTGAAG TAGATCAATT GCTAATCGAA TTTCTATGGC CGAGTAAAAA ATAG 

EF050-4 (SEQ ID NO:188) 

CQIG ATTKDDNQAA 

TKEATVELNR TTTPTLFFHG YAGTKNSFGS LLHRLEKQGA TTQELVLLVK PDGTWKERG 
ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY QVNKANIVGH SMGGVSGLRY 
LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE NGPTEKSSRY LDYQEMINW 
PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNALLRQRGT QVTSQIIKGE NAQHSQLHEN 
PEVDQLLIEF LWPSKK 



EF051-1 (SEQ ID NO:189) 

TAAAAGAAAA GAGGCGTTCA AATGTCTAAA CAAAAAAAGG CTGTGTTCCT GCTTAGTTTA 
TTCAGTTTAG TTGCCCTAAT TGCTGCATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 
ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG 

EF051-2 (SEQ ID NO: 190) 

MSKQ KKAVFLLSLF SLVALIAACT NQPQKETVST KKEEITLAAA ASLESVMEKK 
IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 

EF051-3 (SEQ ID NO:191) 

ATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 

ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
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AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAAC TGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG 

EF051-4 (SEQ ID NO: 192) 

CT NQPQKETVST KKEEITLAAA ASLESVMEKK 

IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 



EF052-1 (SEQ ID NO: 193) 

TAAAGTAGGA GAAGCGCAAG CGAAAAAAGT 
CCCACAATGG GTACCATGGG TAGCATTATC 
TTACTTAGTT CGTCGTGGAG AGAAGTGGAA 
NGAAATCTTC NGTTTTTATT ATTGTTGGTT 
GCAGAAAATA GGGAGACCAC AGAAGTCGGA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG 
GTTCGTAGCA GAACGCAAAT AAAAAGATTA 
CTAAGCTGGT TTGGCATATT GTTTTTAATA 
TTATGTAGAA AAGGAGAATA A 

EF052-2 (SEQ ID NO: 194) 



GAATCAATCG GCAGCGTATC AAGTAGTGAT 
TTTGACAGTA GCACTTGCTG GATTGATTGC 
AAACGAAGGG GAAGTGACAT AATGAGANGA 
CTATTAATTT ATATTCCTCA AACAACTTAT 
ATCGGGTTTA CAAAAACTTC AGACATACCA 
CCGCAAACAA CCATTCAATC GCTATCAATC 
CCTAAAACTG GTGACAATCG AATAACTTGG 
AGTAGTTTTT GGCTGTTTCT ATTTAGACAA 



MRXX 

NLXFLLLLVL LIYIPQTTYA ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKRLP KTGDNRITWL SWFGILFLIS SFWLFLFRQL CRKGE 

EF052-3 (SEQ ID NO:195) 

AGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGGTTTA CAAAAACTTC AGACATACCA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA CCATTCAATC GCTATCAATC 
GTTCGTAGCA GAACGCAAAT AAAAAGAT 

EF052-4 (SEQ ID NO:196) 

ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKR 



EF053-1 (SEQ ID NO.-197) 

TAGTCATGGC ACCATAACAA GGAGGAGAGA AGTGAGATGA AAAAATACCT TTTGCTTAGT 
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TGTTTTTTAG GTCTTTTCAG CTTCTGTCAT TCAGACACTG CGTTTGGAGA AGCAGCTTAT 
GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGATTACC AGCGACAGGT 
ACCACCAATC AAGCACCATT TATTTATTTG GGAATCAGCC TTATCACTAT AGGCATATTA 
TTTATTAAAA GGAGAAGAGA AGATGAAAAA AACAGTATTA GCAGTAGTAG GGATTGTAGG 
ATTTAG 

EF053-2 (SEQ ID NO: 198) 

MKKYLLLSC FLGLFSFCHS DTAFGEAAYE NSGWSFYGT YEYPTEESTT 

ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGRLPATGT TNQAPFIYLG ISLITIGILF 

IKRRREDEKN SISSSRDCRI 

EF053-3 (SEQ ID NO: 199) 

TTTGGAGA AGCAGCTTAT 

GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGA 

EF053-4 (SEQ ID NO:200) 

FGEAAYE NSGWSFYGT YEYPTEESTT 
ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGR 



EF054-1 (SEQ ID NO:201) 

TAAATAAAAA ATTATTTGGA GGAAATTACA ATGAAAAAAA TTATTTTATC AAGCTTGTTT 

AGTGCAGTAC TAGTATTCGG TGGCGGAAGT ATAACAGCAT TCGCTGACGA TTTAGGACCA 

ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT CTAGTGAACC TACGAATCCT 

ACTGAGCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC 

AAGCCAACCG AGCCTACAAC ACCGAGTGAG CCAGAAAAGC CAACAGAACC AACAACGCCA 

ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 

CCAACCGAGC CTACAACACC AAGCGAACCA GAAAAACCTG TTACTCCAGA ACAACCGAAA 

GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 

GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA GTGGAGAAAT TGATAAAACG 

AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA 

ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 

GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 

TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA 

TTGCCACATA CAGGAGAGAA ATTCACACTC CTTTTCTCTG TATTGGGAAG CTTCTTTGTA 

TTAATTTCAG GATTCTTTTT CTTTAAAAAG AATAAGAAAA AAGCTTAA 

EF054-2 (SEQ ID NO:202) 

M KKIILSSLFS AVLVFGGGSI TAFADDLGPT DPATPPITEP TDSSEPTNPT 
EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKVL PHTGEKFTLL FSVLGSFFVL I SGFFFFKKN KKKA 



EF054-3 (SEQ ID NO:203) 
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A 

ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT CTAGTGAACC TACGAATCCT 
ACTGAGCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC 
AAGCCAACCG AGCCTACAAC ACCGAGTGAG CCAGAAAAGC CAACAGAACC AACAACGCCA 
ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
CCAACCGAGC CTACAACACC AAGCGAACCA GAAAAACCTG TTACTCCAGA ACAACCGAAA 
GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA GTGGAGAAAT TGATAAAACG 
AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 
TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA 
T 

EF054-4 (SEQ ID NO: 204) 
DDLGPT DPATPPITEP TDSSEPTNPT 

EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDWV TPSGEIDKTN 
QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEE I VAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKV 



EF055-1 (SEQ ID NO:205) 

TAACAAAAGG TTGTTTTGTC TTTCTTGTGT AAAAGGGCAA GAAAGGCTAG CGAGTTAAAA 

GGAGGTTTTT CAATGAAAAA AAAGCGTTAT TTAATGATTG TGTGTCTACT ATCTTCTCCT 

AGTTTTTTTA TAAATGTTGA AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG GATTGAATTT 

TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC CGAAAACAGA TGCGCCAGCT 

GCTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC AACGAAGTGG TGGTTCGACA 
CAGACCACCA CAACTGGCTC AACGCTCCCT CGTACAGGGA GCAAGAGTCA GGCAAATTTG 

AGCATTCTCN GNTTCGCCTT AATCGGTTTG GCGGGAATCG TACATAGAAA GAAGGGACGA 
CATGAAGCAA ACTAA 

EF055-2 (SEQ ID NO:206) 

MKKKRYL MIVCLLSSPS FFINVEASDG GSSSVGIEFY 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGSTLPR TGSKSQANLS 
ILXFALIGLA GIVHRKKGRH EAN 

EF055-3 (SEQ ID NO:207) 

AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG GATTGAATTT 

TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC CGAAAACAGA TGCGCCAGCT 
GCTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC AACGAAGTGG TGGTTCGACA 
CAGACCACCA CAACTGGCTC AACG 



EF055-4 (SEQ ID NO: 208) 
SDG GSSSVGIEFY 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGST 
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EF056-1 (SEQ ID NO:209) 

TAAATGAAAA AAAAGCGTTA TTTAATAATT GCGTGTTTAC TATTTTCCCC TAGTTTTTTT 
ATAAATGTTG AAGCATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 
CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAGCTCCC TCGTACAGGA AGCAAGAGTC AGGCAAACCT GAGCATTCTT 
GGTCTTGTCT TGATTGGTCT TGTCGGAATG GTCCAGAGAA AGAAGGGACG ACATGAAGCA 
AACTAA 



EF056-2 (SEQ ID NO:210) 

MKKKRYLIIA CLLFSPSFFI NVEASEGGSS 
EPAGPLQGDQ RSGGSTQTTT AGSQLPRTGS 

EF056-3 (SEQ ID NO:211) 



SVG I EFYQNP ATPAPKDAPP KTDEPAADPK 
KSQANLSILG LVLIGLVGMV QRKKGRHEAN 



ATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 

CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAG 

EF056-4 (SEQ ID NO:212) 

SEGGSS SVG I EFYQNP ATPAPKDAPP KTDEPAADPK 
EPAGPLQGDQ RSGGSTQTTT AGSQ 

EF057-1 (SEQ ID NO:213) 

TAATGTTTAT TGGCTGGGCC AGTCAATGTT GAAAATGGGG AAGGAGGAAT TCAGATGAAA 
ATCATAAAAA GGTTTAGTTT GGTATGTTTA GGGCTATTGA TCATTGGGTT GCNAACAAAA 
AGCGNTATGG CTGAAGAAAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 
TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ' ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT TACCTAAAAC AGGCGAGTCT 
GAAAATCCGC TGTATTCCTT GATAGGAGTT AGTTTGTTGG GGATAGTCAT TTATTTAATT 
AATAAAATGA AACGAGAGAA GGAGTTTATT TAA 

EF057-2 (SEQ ID NO:214) 

MKI IKRFSLVCLG LLIIGLXTKS XMAEENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AVLPKTGESE NPLYSLIGVS LLGIVIYLIN 
KMKREKEFI 

EF057-3 (SEQ ID NO:215) 

AAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 

TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT 

EF057-4 (SEQ ID NO:216) 

EENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AV 
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EF058-1 (SEQ ID NO:217) 

TGAAGAACGT TCTATTTGGT TGACGATTGC AGGCCTGCTA ATCATTGGGA TGGTAGTCAT 
TTGGCTATTT TATCAAAAAC AAAAAAGAGG AGAGAGAAAA TGAAGCAATT AAAAAAAGTT 
TGGTACACCG TTAGTACCTT GTTACTAATT TTGCCACTTT TCACAAGTGT ATTAGGGACA 
ACAACTGCAT TTGCAGAAGA AAATGGGGAG AGCGCACAGC TCGTGATTCA CAAAAAGAAA 
ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGC TAAAC AAGCTGTCCA AAGTTTAACT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA 
GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC 
AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC 
TTTAAAAATG AGGCGAATGT TGATAACGGT CATAC CGACG ACCAAACACC ACCAACTGTT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACACAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC TTCAACAGGT GGCAAAGGAA TCTACGTTTA CTTAGGAAGT 
GGCGCAGTCT TGCTACTTAT TGCAGGAGTC TACTTTGCTA GACGTAGAAA AGAAAATGCT 
TAA 

EF058-2 (SEQ ID NO:218) 
MKQLKKVW YTVSTLLLIL PLFTSVLGTT 

TAFAEENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS IYNVTNEFYE 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL AWHIYPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK HKGTLPSTGG KGIYVYLGSG AVLLLIAGVY FARRRKENA 



EF058-3 (SEQ ID NO:219) 
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AGAAGA AAATGGGGAG AGCGCACAGC TCGTGATTCA CAAAAAGAAA 

ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGCTAAAC AAGCTGTCCA AAGTTTAACT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA 
GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC 
AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC 
TTTAAAAATG AGGCGAATGT TGATAACGGT CATACCGACG ACCAAACACC ACCAACTGTT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACACAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC T 

EF058-4 (SEQ ID NO:220) 

EENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS IYNVTNEFYE 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL AWHIYPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK HKGT 



EF059-1 (SEQ ID NO:221) 

TAGATTGGAA GAATGAAAAT GAAAAAAATG ATTATTATTG CCTTATTCAG TACAAGCCTT 
TTAGCAGGGG GAAGCAGTGT TTCTGCTTAT GCGCAAGAAT CAGAAGGAAA TCTTGGTGAA 
ACAACAGGGA GTGTTTTACC AGATGAACCG AATGTACCAA CTGACCCAAT AACGCCAAGT 
GAGCCAGAGC AACCAACAGA GCCAAGTACA CCAGAGCAAC CATCGGAACC GTCAACACCA 
ACCGAACCTA GTGAGCCTTC AAAACCGACG GATCCTTCGT TACCAGACGA ACCGAGCGTA 
CCAACAGAGC CAACAACGCC AAGTAAGCCA GAGCAACCAA CAGAGCCAAC AACGCCAAGT 
GTACCAGAGC AACCAACAGA GCCAAGTGTA CCAGAAAAAC CAGTAGAACC AAATAAACCA 
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ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTACTTCC TTACACTGGT 
GAAAAAATGG GCATAATTGG GTCAATCGCT GGTGTATGTT TGACTGTTTT ATCAGGAATC 
TTAATTTATA AAAAACGTAA AGTGTAG 

EF059-2 (SEQ ID NO:222) 

MKKMI IIALFSTSLL AGGSSVSAYA QESEGNLGET TGSVLPDEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEPTTPSKPE QPTEPTTPSV 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQQPE QPTDWVKPN GEIATGESTQ 
QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP LTQTADGLKP IKSEYKVLPS 
GNVQVKSADG KMKVLPYTGE KMGIIGSIAG VCLTVLSGIL IYKKRKV 

EF059-3 (SEQ ID NO:223) 

AGAAGGAAA TCTTGGTGAA 

ACAACAGGGA GTGTTTTACC AGATGAACCG AATGTACCAA CTGACCCAAT AACGCCAAGT 
GAGCCAGAGC AACCAACAGA GCCAAGTACA CCAGAGCAAC CATCGGAACC GTCAACACCA 
ACCGAACCTA GTGAGCC TTC AAAACCGACG GATCCTTCGT TACCAGACGA ACCGAGCGTA 
CCAACAGAGC CAACAACGCC AAGTAAGCCA GAGCAACCAA CAGAGCCAAC AACGCCAAGT 
GTACCAGAGC AACCAACAGA GCCAAGTGTA CCAGAAAAAC CAGTAGAACC AAATAAACCA 
ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTAC 



EF059-4 (SEQ ID NO:224) 

EGNLGET TGSVLPDEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP 
QPTVPIETNN LSEVTHVPTV TTPIETASGE 
GNVQVKSADG KMKV 

EF060-1 (SEQ ID NO:225) 



PSLPDEPSVP TEPTTPSKPE QPTEPTTPSV 
EKPWPQQPE QPTDWVKPN GEIATGESTQ 
AIVAVDKGVP LTQTADGLKP IKSEYKVLPS 



TGAAAAATAG ACAAGGAGCA CGCGATGATG ACAATGAAAA GTAAAGGGTC ACTTCTGGTG 
ACGTTGGGAA TACTTTTAAC CGTTGGCATT GCGAGTCTAA TTGTTTCTTC TGAGAGTTTT 
GCAGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT TACCAAGGAC AGGTGAACGA AATAGCACGT GGCTTTACAG CCTTGGTATT 
GCCTGTTTAC TCGTAGTACT AACAAGTTTC TATTATTTGA ATAAAAAAAG GAAAAAGGAA 
AAATAA 

EF060-2 (SEQ ID NO:226) 

MMT MKSKGSLLVT LGILLTVGIA SLIVSSESFA EEVGQTNIGV TFYGGKEPLK 
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TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TSLPRTGERN STWLYSLGIA 
CLLWLTSFY YLNKKRKKEK 

EF060-3 (SEQ ID NO:227) 

AGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT 

EF060-4 (SEQ ID NO:228> 
EEVGQTNIGV TFYGGKEPLK 

TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TS 
EF061-1 (SEQ ID NO:229) 



TAATGGAACG 
ATAATGATGA 
AGTGAAATTT 
GAAGTACCAA 
CC AC CTGTAG 
CCGACAACAC 
GAGCCAAGTA 
GAAAAAACTG 
CCAAGCAAGC 
GGTACACAAC 
CCTAGTGTAA 
GGTGTTCCAC 
TTGCCTAGCG 
ACAGGTGAAG 



ACCGACAGAA 
AAAAAATTCT 
CTGCTTTTGC 
CAGAACCAAG 
ACC CTGTAG A 
CAACAGAACC 
AACCAGTAGA 
TGACACCAAC 
CAATCGACGT 
AGCCAACAGT 
CAACACCTAT 
TTACACAAAC 
GAAATGTAGA 
AAATGAATAT 



GAAGATTTTG 
TTTTGCTAGT 
ACAAGAAATT 
TACACCAGAA 
GCCACCTATT 
TACAACTCCT 
ACCTGAAAAA 
TAAACCAACA 
TGTTGTAACG 
CCCTATTGAA 
TACAACTACA 
AGCAGAAGGG 
AGTAAAAGGT 
CTTTTTATCT 



AACTTACAAA 
TTATTTAGTG 
ATCCCTGATG 
AAGCCAACAG 
ACACCAACGG 
ACAGAGCCAA 
CCAGTTACAC 
GAATCTGAAA 
CCAACAGGGG 
ACAAGTAATT 
GACGGAGAAA 
TTAAAAC CTA 
AAGGACGGTA 
GCCGTAGCGG 



TTAAAATTAA 
CCACACTACT 
ATACTACGAC 
ATCCAACACC 
AGCCAACAGA 
GTGAACCAGA 
CAAGCAAACC 
AACCAGTACA 
AATTAAATCA 
TGGCAGAAAT 
ACATTGTAGC 
TTCAATCNAG 
AAATGAAGGT 
TATCTTGTCT 



AATGGAGGAA 
ATTTGGGGGA 
ACCGCCCATT 
GCCAATTGAG 
ACCGACAGAG 
ACAACCAACG 
AGCAGAACCC 
ACCAGCAGAA 
CGCTGGAAAT 
CACGCACGTG 
TGTAGAAAAA 
TTACAAAGTA 
TTTACCATAC 
GTAG 



EPSEPEQPTE 
TGELNHAGNG 
KPIQSSYKVL 



EF061-2 (SEQ ID NO;230) 

MMKKILFASL FSATLLFGGS EISAFAQEII PDDTTTPPIE 
VPTEPSTPEK PTDPTPPIEP PVDPVEPPIT PTEPTEPTEP TTPTEPTTPT 
PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE SEKPVQPAEP SKPIDVWTP 
TQQPTVPIET SNLAEITHVP SVTTPITTTD GENIVAVEKG VPLTQTAEGL 
PSGNVEVKGK DGKMKVLPYT GEEMNIFLSA VAVSCL 

EF061-3 (SEQ ID NO:231) 



GAAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG 
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG 
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG 
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTACA ACCAGCAGAA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT 
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG 
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA 
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TT 
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EF061-4 (SEQ ID NO:232) 



QEII PDDTTTPPIE 
VPTEPSTPEK PTDPTPPIEP 
PSKPVEPEKP VTPSKPAEPE 
TQQPTVPIET SNLAEITHVP 
PSGNVEVKGK DGKMKV 



PVDPVEPPIT PTEPTEPTEP 
KTVTPTKPTE SEKPVQPAEP 
SVTTPITTTD GENIVAVEKG 



TTPTEPTTPT EPSEPEQPTE 
SKPIDVWTP TGELNHAGNG 
VPLTQTAEGL KPIQSSYKVL 



EF062-1 (SEQ ID NO:233) 



TGATTCTTGA AGCAACAAAT GAAAGCAAAA 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA 
GATAATGTAC AAGCCGCGGA ATTAGATACG 
AACCCCGACC TGCAGTCAGA AAAGGAAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT 
GGAGCTGAAA AATCAGCACA AGAACAACCA 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT 
AACATTACCG TTGTTGAAAA ACCAGCAGAA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN 
AAGAACGAAA ACAGCTATGT CAATGAAGCG 
GTCGTGACGA AAGACACTAA AATTTCGTCG 
GATTTTAATA AAGTAAATGC AGGGGATTCA 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA 
GGACTAAACG CTAGTTATTT AGGACGTAAA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG 
GATTTTGGGG CNAACAATGC GTTCAAATAC 
GATGGAAAAT TTTACTCACC GGAAGATATT 
AATAGTGATT GGGACGCTGT AGGTCACAAG 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN 
ATTTTCAATT ATGGGAATCC AAAAGAACCA 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN 
NTCAATGATT TAAATGTGAA NCGTGGCGAT 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA 
GATGCAGAAA AAGTGACGAT TGATTTATCC 
CTNAACGANA AAG AC TNAAA AGCTGTTGCT 
GTGACTGCTT CTTATGANCT CAATTTAGAT 
AACGCNGACG GNTCNGTTGT TTTAGCAATG 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA 
GAAACGGTAA CAAATACAGT GATTAACCAT 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT 
CAAACAAAAA TTTATTATGA AGTGAAATCT 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG 
TGGCACGCTA TTACNAANTA TGACCTTAAA 
GATATTTCTG CCTACATTCT TTTAGAAAAC 
AATCAAGCAT TATTGGCNGC NTTAAATGAA 



AAACAATATA AGACATATAA AGCTAAGAAT 
AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
CAACCAGAAA CAACGACGGT TCAACCCAAT 
CCTAAAACGG CAGTATCTGA AGAAGCAACA 
AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GATACCACAA ACGCGCAACA ACCAACAGTA 
GTAGTAAGCC CTGAAACAAC CAATGAACCT 
GAAAATGAAG TGAATAAATC AACGTCCATT 
AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GANAAAGAAG TCGCNGAATA CAACAAGCAT 
ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
AAAGATATCT TTACAAAATT ACGGAAAGAT 
AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
AAAAATAAAC CAGTGACAGT GACCTATACA 
ATTACAAAAG CAGAATTTGT TTATGAACTA 
AATGCAGTAT TTTCAAACGA TCCGATTATC 
GGTAAGGATG TTAAAACACG CTTAACGATT 
CTACCAGATA AAGATAGTCC ATTTGCGTAT 
AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
ATGACAACAA AAGGAAAAAG TAATGTGCCT 
ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
GAAAAAGCAA CGATTGAATT CAATNGATAC 
AATAAAGAAG TCACTGATGG NCAGAAAAAT 
TCTTTACAAT ACATTGTGAC AGGGGATACG 
GTAACNAAAC AAGGGATTCG AGATACNTTT 
AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
CAAAACACCG TCACAGCAAT GATGAAAACC 
GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
AATACAGCTG TTCAGCTGAC AAANGATGGN 
GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
TCCGAACGTC CAGCNAACTA TGGCGGAATN 
GACACGACCC ATGATCGTTT CACAGGNAAA 
GTAGGGGANA AAACGTTAAA AGCAGGAACA 
AAAGACAATA AAGACTTGAC GTTTACNATG 
GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
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TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF062-2 {SEQ ID NO:234) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF062-3 (SEQ ID NO:235) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
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GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTA CTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCC TTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
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TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TG 



EF062-4 (SEQ ID NO:236) 
AELDTQ PETTTVQPNN 

PDLQSEKETP KTAVSEEATV QKDTTSQPTK 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN 
SSAQWFAFXT NLNAQSVKP I FNYGNPKEPE 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD 
ISAYILLENK DNKDLTFTMN QALLAALNEG 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG 
GVERIAAGDV YNTIEESFNN EKIKTNTWT 
WEKASV 

EF063-1 (SEQ ID NO:237) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA 
GATAATGTAC AAGCCGCGGA ATTAGATACG 
AACCCCGACC TGCAGTCAGA AAAGGAAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT 
GGAGCTGAAA AATCAGCACA AGAACAACCA 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT 
AACATTACCG TTGTTGAAAA ACCAGCAGAA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN 
AAGAACGAAA ACAGCTATGT CAATGAAGCG 
GTCGTGACGA AAGACACTAA AATTTCGTCG 
GATTTTAATA AAGTAAATGC AGGGGATTCA 



VEEVAPENKG TEQSSATPND TTNAQQPTVG 
NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
WKITASQAF XDAMNLKENK NVAHSWKAFI 
HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 



AAACAATATA AGACATATAA AGCTAAGAAT 
AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
CAACCAGAAA CAACGACGGT TCAACCCAAT 
CCTAAAACGG CAGTATCTGA AGAAGCAACA 
AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GATACCACAA ACGCGCAACA ACCAACAGTA 
GTAGTAAGCC CTGAAACAAC CAATGAACCT 
GAAAATGAAG TGAATAAATC AACGTCCATT 
AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GANAAAGAAG TCGCNGAATA CAACAAGCAT 
ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
AAAGATATCT TTACAAAATT ACGGAAAGAT 
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ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF0 63-2 (SEQ ID NO: 23 8) 
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MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF063-3 (SEQ ID NO:239) 

GGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 

AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GG AC TAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
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AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 

NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 

ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 

GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTG 



EF063-4 (SEQ ID NO:240) 
ELDTQ PETTTVQPNN 

PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK V 

EF064-1 (SEQ ID NO:241) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGGTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
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GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCC^TTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF064-2 (SEQ ID NO:242) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
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FDTVDLATGV SFFDDYDETX VTPIKDLLRV 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG 
GVERIAAGDV YNTIEESFNN EKIKTNTWT 
WEKASWPE LPQTGEKQMV LLTVAGSLAA 



KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
WKITASQAF XDAMNLKENK NVAHSWKAFI 
HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
MLGLAGLGFK RRKETK 



EF064-3 (SEQ ID NO:243) 



AGTGACGAT TGATTTATCC AAAGTGAAAG : 
CTNAACGANA AAG AC TNAAA AGCTGTTGCT 
GTGACTGCTT CTTATGANCT CAATTTAGAT 
AACGCNGACG GNTCNGTTGT TTTAGCAATG 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA 
GAAACGGTAA CAAATACAGT GATTAACCAT 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT 
CAAACAAAAA TTTATTATGA AGTGAAATCT 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG 
TGGCACGCTA TTACNAANTA TGACCTTAAA 
GATATTTCTG CCTACATTCT TTTAGAAAAC 
AATCAAGCAT TATTGGCNGC NTTAAATGAA 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT 
AAACCAACCA AAGCCGTTCA TAACAAGAAA 
CGTGGTGATG TTCTTTCTTA TGAAATGACN 
GCCTTTGATA CAGTCGATCT TGCGACAGGC 
AANGTGACAC CAATCAAAGA CTTACTTCGT 
AACCAGTTCA CGATCTCNTG GGACGATGCC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT 
CGAATTAAAA CCAATACNGT TGTCAACCAT 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA 
TTCTTCTATG AATTTACAAG TAGTGACATT 
TGGTCGATTA GCGATAAACT AGACGTCAAA 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC 
TCGAAACTAT TCACGATGAC CTTTGAACAA 
TTTTTNGATG CGATGAATCT AAAAGAAAAC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG 
CCAGAAAAAA CAGTGATTGT ACCACCAACA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC 
AAACGTAGAA AAGAAACAAA ATAA 



"TTATCAAGC AGACGCAAGT 
GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
CAAAACACCG TCACAGCAAT GATGAAAACC 
GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
AATACAGCTG TTCAGCTGAC AAANGATGGN 
GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
TCCGAACGTC CAGCNAACTA TGGCGGAATN 
GACACGACCC ATGATCGTTT CACAGGNAAA 
GTAGGGGANA AAACGTTAAA AGCAGGAACA 
AAAGACAATA AAGACTTGAC GTTTACNATG 
GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
ACAGGTGACG TAGAAAACAC GCAAACAGAA 
ACNGTGGTGA CGCATACNCC TGATGATCCA 
GGGGAAGANA TTAANCATGG AAAAGTNGCT 
TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GTTTCTTTCT TCGATGATTA CGATGAAACG 
GTCAAAGATT CTAAAGGGGN AGACATTACG 
AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CAAGAATTGC GTGTAACNCT CCCTACAAAA 
AATTCAGCGG AACAAAATAC ATTTGGNCAA 
ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
AATGGNGCCA CAATCAAATT AGGGGAGAAN 
CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
GGAACCAAAG TGAATAAAGG GGACGACATT 
GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
AAAAACGTTG CACACTCATG GAAAGCGTTC 
GTTTACAACA CAATCGAAGA ATCTTTCAAC 
ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 



EF064-4 (SEQ ID NO: 244) 



VTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD 
ISAYILLENK DNKDLTFTMN QALLAALNEG 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN 



NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
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IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 

ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 

GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASV 



EF065-1 (SEQ ID NO:245) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGaCAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CC AC TAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
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GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF065-2 (SEQ ID NO:246) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF065-3 (SEQ ID NO:247) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTG AAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT. TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
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CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT 

EF065-4 (SEQ ID NO:248) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEEL.APYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 

EF066-1 (SEQ ID NO:249) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
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GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGG AAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF066-2 (SEQ ID NO:250) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
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PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF066-3 (SEQ ID NO:251) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCA 

EF066-4 (SEQ ID. NO:252) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TI SATS TEG Y VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVT 

EF067-1 (SEQ ID NO:253) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CC AC ATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
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AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATG1TA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF067-2 (SEQ ID NO:254) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 



WO 98/50554 PCT/US98/08959 

152 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF067-3 (SEQ ID NO:255) 

GCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 

GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAAC TTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT 

EF067-4 (SEQ ID NO:256) 

VLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 



EF068-1 (SEQ ID NO:257) 

TAGGGGAAGC TAATGATCTT GGTATTTATC 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT 
CGCAAGATGC TTAATTTAGC AATATCAAGT 
GTAAGTATAG CTGTTACCTC TGGCACAATC 
CTATTATCAA ATGTTACGTC AAATAATGAC 
GCCGCAAACC AAAATCAACC AGTTAATTTC 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC 



GTTTATTTTA AAGAAAAGAG GGACGATCAG 
CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
GGTTTATTAT TTACGTCATT AGCAATCCCT 
AGTGCATCAG CAGCGGTCTT GGATATCGAA 
AGTGGCACTT CAACGAGTAA TCGTTGGACA 
ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATACCAATG TCACGATTGA TCTTTCAAAA 
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GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
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ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTGC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF068-2 (SEQ ID NO:258) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TI IGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 
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EF068-3 (SEQ ID NO : 259 ) 

CTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 

CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC 



EF068-4 (SEQ ID NO:260) 

TSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGWELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTP 

EF069-1 (SEQ ID NO:261) 

TAGGGGAAGC TAATGATCTT GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG 

ATGAAAAAGA AAATTGTTGA GGATTTTAAT CGGAAAAGTC AGCATAAAAA ATGGACAAAA 

CGCAAGATGC TTAATTTAGC AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT 

GTAAGTATAG CTGTTACCTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 

CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 

GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 

TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 

AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 

GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
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ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGG AAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCG AT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
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GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF069-2 (SEQ ID NO:262) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTS DANG DFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 



EF069-3 (SEQ ID NO:263) 
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AGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 

GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAA 



EF069-4 (SEQ ID NO:264) 

AGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGT 

EF070-1 (SEQ ID NO:265) 

TAGGGGAAGC TAATGATCTT GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
CGCAAGATGC TTAATTTAGC AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT 
GTAAGTATAG CTGTTACCTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 
CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
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CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GG AC TGCTG A AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
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ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF070-2 <SEQ ID NO:266) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMG I IKRKRKN 

EF070-3 (SEQ ID NO:267) 

CGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
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GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTC TAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACT 



EF70-4 (SEQ ID NO:268) 
DGDGNE SQPTEVTVPE 

DATVAAPTVT TVTGTTATGY QVTGTAEPNV 
TATANEALTA IAKDAAGKES NPTAFKTPAD 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI 
TPGETITIIS KDGAGNESQP ATAVIPADW 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP 
TAGQATAQQS LLATATDGAG HYSTATTFMT 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS 
YL 



TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
PDAPVATPTV DKITGSTTNG YQWGAAEVG 
GKASANETIT WAKNATGKE SQPATATTPV 
DVRDADGTII AATTANETGQ YTVTLPAGW 
LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
EKETLTALTT DTQGNVSPKT TFMTPADITG 
NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 



EF071-1 (SEQ ID NO:269) 

TAAGTAGAAG TGGTCGGGAC AAACGTAGAA 
GTCCCGCCAT TTATCTGCAG GTTTAAGCCG 
ATGGCTTTTT TAAGAAAGGA GCATGCTATG 
GTGATTGGTT TAAGTTTAAC GATTCCGATG 
CCAATCAACT TTAC TTATTT TCCCGGCTCT 
TCTGGAAACG AGCGGAACCT AGGACCACAC 
CGAAATTGGT CAAATGCTTA TGTCTCATAT 



CTTTCGCTGA TTGCCGAAGA AATTACTTCT 
TGGAAGGGAA GTTATTTTGA CTTTCCTTTC 
TTTAAAAAAT TAATGATTCA ACTTGCTTTA 
ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
GCAAGCAATG AATTAATTGT TTTACATGAA 
AGTTTAGACA ATGAAGTGGC CTATATGAAA 
TTTGTCGGAT CTGGTGGACG AGTGAAACAA 
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TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT 
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC 
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT 
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGGTA A 



EF071-2 (SEQ ID NO:270) 
MF KKLMIQLALV 

IGLSLTIPMT AXAYTIEADP INFTYFPGSA 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW 
TGVSXTGETG HYSAR 

EF071-3 (SEQ ID NO:271) 

G TTTAAAAAAT TAATGATTCA ACTTGCTTTA 

GTGATTGGTT TAAGTTTAAC GATTCCGATG ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
CCAATCAACT TTACTTATTT TCCCGGCTCT GCAAGCAATG AATTAATTGT TTTACATGAA 
TCTGGAAACG AGCGGAACCT AGGACCACAC AGTTTAGACA ATGAAGTGGC CTATATGAAA 
CGAAATTGGT CAAATGCTTA TGTCTCATAT TTTGTCGGAT CTGGTGGACG AGTGAAACAA 
TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT 
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC 
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT 
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGGT 

EF071-4 (SEQ ID NO:272) 

F KKLMIQLALV 

IGLSLTIPMT AXAYTIEADP INFTYFPGSA SNELIVLHES GNERNLGPHS LDNEVAYMKR 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA GSLANQKAYA QIELARTNNA ATFKKDYAAY 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWWGDHT DPYGYLARGG LVKRIGTRFT 
TGVSXTGETG HYSAR 

EF072-1 (SEQ ID NO:273) 

TAATCAATGA AAAACGCACG TTGGTTAAGT ATTTGCGTCA TGCTACTCGC TCTTTTCGGG 
TTTTCACAGC AAGCATTAGC AGAGGCATCG CAAGCAAGCG TTCAAGTTAC GTTGCACAAA 
TTATTGTTCC CTGATGGTCA ATTACCAGAA CAGCAGCAAA ACACAGGGGA AGAGGGAACG 
CTGCTTCAAA ATTATCGGGG CTTAAATGAC GTCACTTATC AAGTCTATGA TGTGACGGAT 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG AGGCACAGCG TCAATTAGCA 
GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA AAACACAGAC AATAAATGGA 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT CGCAGCAACG AGATAAAGCC 
TATTTATTTG TTGAAGCGGA AGCACCAGAA GTGGTAAAGG AAAAAGCTAG CAACCTAGTA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA CGCATATTCA TTTATATCCA 
AAAAATGAAG AAAATGCCTA TGACTTACCA CCACTTGAAA AAACGGTACT CGATAAGCAA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA GATAAGGCGG ATACAACGTT GACACTTTTA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA CTACAGGTTA CACACTGACG 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG ACTTACAAAA CTTTGCAAAT 



SNELIVLHES GNERNLGPHS LDNEVAYMKR 
GSLANQKAYA QIELARTNNA ATFKKDYAAY 
ITKNWWGDHT DPYGYLARGG LVKRIGTRFT 
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CAAACAATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA CCGCTGAACC TGACACTGCG 
ATTAACAACG AAGGACAATT AGTCACGGAC AAACATACCT TGACTAAAAG AGCCACAGTT 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG GGGGAATACC TCAATGAAAC AGCAAACGGG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA CGTCTAATCA AGCCGGTGAA 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA GTACTTCTTG GAAGAAATCT CTGCACCAAA 
AGGTTATCTT CTGAATCAAA CAGAAATTCC TTTTACGGTG GGAAAAAATT CTTATGCAAC 
GAACGGACAA CGAACAGCAC CGTTACATGT AATCAATAA 



EF072-2 (SEQ ID NO:274) 



MKNARWLS I CVMLLALFGF SQQALAEASQ ASVQVTLHKL LFPDGQLPEQ • QQNTGEEGTL 
LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT VQEAQRQLAE TGATNRKP I A EDKTQTINGE 
DGWSFSLAS KDSQQRDKAY LFVEAEAPEV VKEKASNLW ILPVQDPQGQ SLTHIHLYPK 
NEENAYDLPP LEKTVLDKQQ GFNQGEHINY QLTTQIPANI LGYQEFRLSD KADTTLTLLP 
ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS IKDLQNFANQ TMTVSYQMRL EKTAEPDTAI 
NNEGQLVTDK HTLTKRATVR TGGKSFVKVD SENAKITLPE AVFIVKNQAG EYLNETANGY 
RWQKEKALAK KFTSNQAGEF SVKGXKRWPV LLGRNLCTKR LSSESNRNSF YGGKKFLCNE 
RTTNSTVTCN Q 



EF072-3 (SEQ ID NO:275) 



ATTACCAGAA CAGCAGCAAA ACACAGGGGA 
CTGCTTCAAA ATTATCGGGG CTTAAATGAC 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA 
GAAACCGGTG CAACAAATAG AAAACCGATC 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT 
TATTTATTTG TTGAAGCGGA AGCACCAGAA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG 
AAAAATGAAG AAAATGCCTA TGACTTACCA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT 
CAAACAATGA CTGTGTCGTA TCAAATGCGT 
ATTAACAACG AAGGACAATT AGTCACGGAC 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA 
AGGTTATCTT CTGAATCAAA CAGAAATTCC 
GAACGGACAA CGAACAGCAC CGTTACATGT 



AGAGGGAACG 

GTCACTTATC AAGTCTATGA TGTGACGGAT 
ACGGTCCAAG AGGCACAGCG TCAATTAGCA 
GCAGAAGATA AAACACAGAC AATAAATGGA 
AGCAAAGATT CGCAGCAACG AGATAAAGCC 
GTGGTAAAGG AAAAAGCTAG CAACCTAGTA 
CAATCGTTAA CGCATATTCA TTTATATCCA 
CCACTTGAAA AAACGGTACT CGATAAGCAA 
TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
GATAAGGCGG ATACAACGTT GACACTTTTA 
AAAACAGTTA CTACAGGTTA CACACTGACG 
TCAATTAAAG ACTTACAAAA CTTTGCAAAT 
TTAGAAAAGA CCGCTGAACC TGACACTGCG 
AAACATACCT TGACTAAAAG AGCCACAGTT 
GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GGGGAATACC TCAATGAAAC AGCAAACGGG 
AAAAAATTCA CGTCTAATCA AGCCGGTGAA 
GTACTTCTTG GAAGAAATCT CTGCACCAAA 
TTTTACGGTG GGAAAAAATT CTTATGCAAC 
A 



EF072-4 (SEQ ID NO:276) 



QLPEQ QQNTGEEGTL 
LQNYRGLNDV TYQVYDVTDP 
DGWSFSLAS KDSQQRDKAY 
NEENAYDLPP LEKTVLDKQQ 
ESIEVKVAGK TVTTGYTLTT 
NNEGQLVTDK HTLTKRATVR 
RWQKEKALAK KFTSNQAGEF 
RTTNSTVTC 



FYQLRSEGKT VQEAQRQLAE 
LFVEAEAPEV VKEKASNLW 
GFNQGEHINY QLTTQIPANI 
QKHGFTLDFS IKDLQNFANQ 
TGGKSFVKVD SENAKITLPE 
SVKGXKRWPV LLGRNLCTKR 



TGATNRKP I A EDKTQTINGE 
ILPVQDPQGQ SLTHIHLYPK 
LGYQEFRLSD KADTTLTLLP 
TMTVSYQMRL EKTAEPDTAI 
AVFIVKNQAG EYLNETANGY 
LSSESNRNSF YGGKKFLCNE 



WO 98/50554 



PCTAJS98/089S9 



164 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 



EF073-1 (SEQ ID NO:277) 

TAAATGAACA AATTAAATAC AAAATTACTG ATTGGCTATA TTCTTTTAGG AGCCTTAATC 
ATTGCTGTCG CTAGAGAATA TGGCTTCTTC GCTTTTGTGA TTCTGGTAGG CTTTTTAGTA 
TTCGTTCTCT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT 
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA 
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG 
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATG AC TTAG A CGATTTAGAT 
GTCGAAATGT CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT 
CAAGTAAACG AAGACCAGCA ATAA 

EF073-2 (SEQ ID NO:278) 

MNKLNTKLLI GYILLGALII AVAREYGFFA FVILVGFLVF VLYRKKKNAA DKSDQMPYLT 
KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ IIQLQENMNR STKLRAIDLR NDTTKVSKAL 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS QKAATEESPQ VNEDQQ 

EF073-3 (SEQ ID NO:279) 

CT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT 
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA 
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG 
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CGATTTAGAT 
GTCGAAATGT CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT 
CAAGTAAACG AAGACCAGCA AT 



EF073-4 (SEQ ID NO:280) 

YRKKKNAA DKSDQMPYLT 
KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS 

EF074-1 (SEQ ID NO:281) 



IIQLQENMNR STKLRAIDLR NDTTKVSKAL 
KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS 
QKAATEESPQ VNEDQQ 



TAAAGGAGTT CTCAAAAAAT GAAGCTAAAA AAAATAATTC CTGCTTTTCC CCTTCTTTCA 
ACCGTTGCAG TTGGCTTGTG GTTAACGCCT ACTCAAGCTT CTGCAGATGC TGCGGATACG 
ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAACTG GGCCTCAAAA 
GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA 
GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 
TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 
CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGTCAAA 
GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 
GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAACCA AACCGTCATC 
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CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 
ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 
TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 
GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCTATN ATATT 

EF074-2 (SEQ ID NO:282) 

MKLKK IIPAFPLLST VAVGLWLTPT QASADAADTM VDISGKKVLV GYWHNWASKG 
RDGYKQGTSA SLNLSEVNQA YNWPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
GRAVLLALGG ADAHIQLVKG DEQAFANEII RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV 
WVDEVMTWVA QSNDALKYEF LYXI 

EF074-3 (SEQ ID NO:283) 

TGC TGCGGATACG 

ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAACTG GGCCTCAAAA 
GG AC GCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA 
GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 
TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 
CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGTCAAA 
GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 
GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAACCA AACCGTCATC 
CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 
ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 
TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 
GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCT 

EF074-4 (SEQ ID NO:284) 
AADTM VDISGKKVLV GYWHNWASKG 

RDGYKQGTSA SLNLSEVNQA YNWPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
GRAVLLALGG ADAHIQLVKG DEQAFANEII RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV 
WVDEVMTWVA QSNDALKYEF LY 



EF075-1 (SEQ ID NO:285) 

TAACCTATAA GAAAAAAATC ACAACCTGTG 
GGGAAGAAAA TTTTTGCCAT TATCNTTGGA 
GGAATGGGAG CAAAACTTTA TTGGGATGTT 
GTAGAACGAT CTAAAAAAAG TCAGGTCAAT 
TTATTAGGGA TTGATACAGG CGATGATGGG 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC 
TTGGATGGTG AACAAGCACT CTCCTATTCA 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG 
CTTAACAGCG TAAGCAACTA TCAAGAAATT 
GATTTAAGTT TTGATGACAT GAAAAAAATT 



ATAAATTATT GGAGGNAAAA TATGTCAAAA 
ATTATCTTGG NTCTATTTCT TGCAGTTGTT 
TCTAAATCAA TGGATAAAAC CTATGAAACA 
TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
CGTGTCGAGC AAGGTCGTTC GGATACAACA 
CAAACAACCT TAGTCAGTCT TGCTCGCGAT 
CAAGATAAAT TGAATCACGC CTATGCTTTT 
GAAAACTATT TAAACATACC TATTAATCAT 
GAATTAGTCA ACGCGGTTGG CGGAATCGAA 
GGATATGATT TTACGATTGG TAAAATTTCA 
AGAATGCGTT ACGAAGACCC TAATGGTGAC 
ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
TTAACAGCTG TTTCTGATAA TATGAAGACA 
GCCTTAGATT ATCGCAGTGC CTTTGGTAAA 
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GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTTC CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 
TAA 

EF075-2 (SEQ ID NO:286) 

MSKG KKIFAIIXGI ILXLFLAWG MGAKLYWDVS KSMDKTYETV 

ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPRDKQ TTLVSLARDT 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NYLNIPINHY VSINMAGLKE LVNAVGGIEV 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI EGIVQKVLSL 
NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA LDYRSAFGKV KQDQLQGTGF MQDGVSYQRV 
DEQELTRVQQ ELKNQLNTK 



EF075-3 (SEQ ID NO: 287) 

ACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA 

GTAGAACGAT CTAAAAAAAG TCAGGTCAAT TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
TTATTAGGGA TTGATACAGG CGATGATGGG CGTGTCGAGC AAGGTCGTTC GGATACAACA 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG CAAACAACCT TAGTCAGTCT TGCTCGCGAT 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA CAAGATAAAT TGAATCACGC CTATGCTTTT 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT GAAAACTATT TAAACATACC TATTAATCAT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA GAATTAGTCA ACGCGGTTGG CGGAATCGAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC GGATATGATT TTACGATTGG TAAAATTTCA 
TTGGATGGTG AACAAGCACT CTCCTATTCA AGAATGCGTT ACGAAGACCC TAATGGTGAC 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
CTTAACAGCG TAAGCAACTA TCAAGAAATT TTAACAGCTG TTTCTGATAA TATGAAGACA 
GATTTAAGTT TTGATGACAT GAAAAAAATT GCCTTAGATT ATCGCAGTGC CTTTGGTAAA 
GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTTC CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 



EF075-4 (SEQ ID NO:288) 
KLYWDVS KSMDKTYETV 

ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR 
NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA 
DEQELTRVQQ ELKNQLNTK 



VEQGRSDTTI VATVNPRDKQ TTLVSLARDT 
NYLNIPINHY VSINMAGLKE LVNAVGGIEV 
MRYEDPNGDY GRQERQRKVI EGIVQKVLSL 
LDYRSAFGKV KQDQLQGTGF MQDGVSYQRV 



EF076-1 (SEQ ID NO:289) 

TAGAAAATAA CAGAGGAGCT GAAGGAAATG 
AGCATTGCTG CAGTTGCAAG TGTCTCTGTT 
AAGGTATCTC ATGTTTCCAA TCGTTATAAA 
GGAAACCAAA AATTATTATC GATTGTCGAT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC 
GTTAAAGACA ATACAGATTC TTTAAAAGAA 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT 
TAA 



AAAGCATCAA CAAAAATTGG TATCGGTTTA 
GCAGTCATCG CTTCTGAAAA AATTATTAAG 
GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GATTTATCCG ATGATGAATT AGATTCTGTT 
GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
CGCTTTTTCA CATTTATTGA AGATGCAATG 
TTTTATAAAA ATAATTCTTT TGTTTCAACA 
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EF076-2 (SEQ ID NO:290) 

MK ASTKIGIGLS IAAVASVSVA VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 
NQKLLSIVDD LSDDELDSVL NWDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 
LKKWPRPSFF YKNNSFVST 

EF076-3 (SEQ ID NO:291) 

CATCG CTTCTGAAAA AATTATTAAG 

AAGGTATCTC ATGTTTCCAA TCGTTATAAA GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GGAAACCAAA AATTATTATC GATTGTCGAT GATTTATCCG ATGATGAATT AGATTCTGTT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
GTTAAAGACA ATACAGATTC TTTAAAAGAA CGCTTTTTCA CATTTATTGA AGATGCAATG 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT TTTTATAAAA ATAATTCTT 



EF076-4 (SEQ ID NO:292) 

VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 
NQKLLSIVDD LSDDELDSVL NWDRVKDGG 
LKKWPRPSFF YKNNS 

EF077-1 (SEQ ID NO:293) 



SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 



TAATGTAAAG TGAATGATGG GAGAGAAAAA GAGATGAAGC ATGTAACAAA ATTGGGGATT 
ACAATTATAA CAGGAGTTTT GGCATTATTA TTTGAATTTA TTTTACATCA GCCGAATTGG 
GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAATGATGTT CTGGGAAATG 
ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAGCGATTAC CGCTATCGTT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTTAATTAT GTTGACTGGT 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA GTCATTATTG 
GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAACTAGT TCCAGTTGAT 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CATTAACAGG AGAATCAAAA 
CCAATTGAAA AAAATCCTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT TGTGAACTTA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGCAGATCG CTATGCGGTA 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTCGTTAT TAAATCGGGA 
ACGATGGTCG AAAAATTAGC TTCTGCAAAA ACGATTGCGT TTGATAAAAC AGGCACGATT 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTGGAAT AACTGCTGCT 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA GTCATATTTT AGCTAGATCA 
ATTGTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AGATACGGGT AGGTAAAAAG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC TATTCATATT 
TCACGTAATG GCACATATTT AGGCCGAATT ACTTTTACAG ACACTGTACG CCCAGAAGCA 
AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA AGTACATGGG 
GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GTCATCATGG TAGGAGATGG TGTAAATGAT GCACCTTCGC TTGCTGCTGC AGACGTAGGT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTGCTGACGT TGTTATTTTA 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAAGATAC CATGAAAATT 
GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAATGTT AATTGCTAGT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTCGTGGA CACTGTGTCA 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCCAGTAA 
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EF077-2 (SEQ ID NO:294) 

MKHVTKLGIT IITGVLALLF EFILHQPNWA YGIILITGSV MALMMFWEMI 
QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTVD ESSLTGESKP 
IEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP 
FTLVAYLIAG VAWFVSKSPT RFAEVLWAS PCPLILSAPI ALVAGMGRSS RHGWIKSGT 
MVEKLASAKT IAFDKTGTIT QGQLSVDQVQ PINAGITAAE LVGLAASVEQ ESSHILARSI 
VAYARKQDVP LKNITDLAEV SGAGVKAFVD GAEIRVGKKN FVTQESQETE KIDKTTIHIS 
RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH LQRILMLTGD QESVAETIAA EVGITEVHGE 
CLPQDKLTIL KELPKENHPV IMVGDGVNDA PSLAAADVGI AMGAHGATAA SETADWILK 
DDLSKVSQAV EIAQDTMKIA KQSVLIGIFI CVLLMLIAST GIIPALIGAM LQEWDTVS I 
LSALRARRIG Q 

EF077-3 (SEQ ID NO:295) 

TCA GCCGAATTGG 

GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAATGATGTT CTGGGAAATG 
ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAGCGATTAC CGCTATCGTT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTTAATTAT GTTGACTGGT 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA GTCATTATTG 
GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAACTAGT TCCAGTTGAT 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CATTAACAGG AGAATCAAAA 
CCAATTGAAA AAAATCCTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT TGTGAACTTA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGCAGATCG CTATGCGGTA 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTCGTTAT TAAATCGGGA 
ACGATGGTCG AAAAATTAGC TTCTGCAAAA ACGATTGCGT TTGATAAAAC AGGCACGATT 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTGGAAT AACTGCTGCT 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA GTCATATTTT AGCTAGATCA 
ATTGTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AGATACGGGT AGGTAAAAAG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC TATTCATATT 
TCACGTAATG GCACATATTT AGGCCGAATT ACTTTTACAG ACACTGTACG CCCAGAAGCA 
AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA AGTACATGGG 
GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GTCATCATGG TAGGAGATGG TGTAAATGAT GCACCTTCGC TTGCTGCTGC AGACGTAGGT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTGCTGACGT TGTTATTTTA 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAAGATAC CATGAAAATT 
GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAATGTT AATTGCTAGT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTCGTGGA CACTGTGTCA 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCC 

EF077-4 (SEQ ID NO:296) 

QPNWA YGIILITGSV MALMMFWEMI 

QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTVD ESSLTGESKP 
IEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP 
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FTLVAYLIAG VAWFVSKSPT RFAEVLWAS PCPLILSAPI ALVAGMGRSS RHGWIKSGT 
MVEKLASAKT IAFDKTGTIT QGQLSVDQVQ PINAGITAAE LVGLAASVEQ ESSHILARSI 
VAYARKQDVP LKNITDLAEV SGAGVKAFVD GAEIRVGKKN FVTQESQETE KIDKTTIHIS 
RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH LQRILMLTGD QESVAETIAA EVGITEVHGE 
CLPQDKLTIL KELPKENHPV IMVGDGVNDA PSLAAADVGI AMGAHGATAA SETADWILK 
DDLSKVSQAV EIAQDTMKIA KQSVLIGIFI CVLLMLIAST GIIPALIGAM LQEWDTVSI 
LSALRARRIG 



EF079-1 (SEQ ID NO:297) 



TAATTTCTAG CATCACCGAA GAAATTTTTA 
CCCAGGCTCT CATGCTTTAT TTTTAAGGAG 
ATCATTGATG GTTTTATGAT TCTTTTACTG 
TTTGTTAGCG ATGCATTAAA TAACTATCTG 
AAAGCAAGCC AAGAAAACAC CAAAGAAATG 
AACCAAGAAT TAGCGAAAAA AGGCAGCAAT 
AAAACAACGA AAAAACCAGA CAAATCCTAT 
ATTCCAAAAA TAAATGTCCG TTTACCAATT 
AAAGGAAGCT CCTTGTTAGA AGGAACCTCC 
GTCATTTCAG GCCATCGTGG TCTCCCTCAA 
AAAAAAGGCG ATGAATTTTA TATCGAAGTC 
CAAATAAAAA CCGTTGAACC AACTGATACA 
CTCGTCACTT TATTAACTTG CACACCGTAT 
GGACATCGTA TCCCATATCA ACCAGAAAAA 
CAACAAAATT TACTATTATG GACATTACTT 
TTCATTATCT GGTACAAGCG ACGGAAAAAG 



GAAAAACAAA GAGCCTGGGC CAATCACTGT 
GAAGCAATGA AGTCAAAAAA GAAACGTCGT 
ATTATTGGAA TAGGTGCATT TGCGTATCCT 
GATCAACAAA TTATCGCTCA TTATCAAGCA 
GCTGAACTTC AAGAAAAAAT GGAAAAGAAA 
CCTGGATTAG ATCCTTTTTC TGAAACGCAA 
TTTGAAAGTC ATACGATTGG TGTTTTAACC 
TTTGATAAAA CGAATGCATT GCTATTGGAA 
TATCCTACAG GTGGTACGAA TACACATGCG 
GCCAAATTAT TTACAGATTT GCCAGAATTA 
AATGGGAAGA CGCTTGCTTA TCAAGTAGAT 
AAAGATTTAC ACATTGAGTC TGGCCAAGAT 
ATGATAAACA GTCATCGGTT ATTAGTTCGA 
GCAGCAGCGG GGATGAAAAA AGTGGCACAA 
TTAATTGCCT GTGCGTTAAT TATTAGCGGC 
ACGACCAGAA AACCAAAGTA G 



EF079-2 (SEQ ID NO:298) 



MKSKKKRRI IDGFMJj_sLLI IGIGAFAYPF 
VSDALNNYLD QQ I I AHYQAK ASQENTKEMA 
TTKKPDKSYF ESHTIGVLTI PKINVRLPIF 
ISGHRGLPQA KLFTDLPELK KGDEFYIEVN 
VTLLTCTPYM INSHRLLVRG HRIPYQPEKA 
I IWYKRRKKT TRKPK 



ELQEKMEKKN QELAKKGSNP GLDPFSETQK 
DKTNALLLEK GSSLLEGTSY PTGGTNTHAV 
GKTLAYQVDQ IKTVEPTDTK DLHIESGQDL 
AAGMKKVAQQ QNLLLWTLLL IACALIISGF 



EF079-3 (SEQ ID NO:299) 



TCCT 

TTTGTTAGCG ATGCATTAAA 
AAAGCAAGCC AAGAAAACAC 
AACCAAGAAT TAGCGAAAAA 
AAAACAACGA AAAAACCAGA 
ATTCCAAAAA TAAATGTCCG 
AAAGGAAGCT CCTTGTTAGA 
GTCATTTCAG GCCATCGTGG 
AAAAAAGGCG ATGAATTTTA 
CAAATAAAAA CCGTTGAACC 
CTCGTCACTT TATTAACTTG 
GGACATCGTA TCCCATATCA 
CAACAAAATT TACTATTATG 
TTCATTATCT GGTACAAGCG 



TAACTATCTG GATCAACAAA 
CAAAGAAATG GCTGAACTTC 
AGGCAGCAAT CCTGGATTAG 
CAAATCCTAT TTTGAAAGTC 
TTTACCAATT TTTGATAAAA 
AGGAACCTCC. TATCCTACAG 
TCTCCCTCAA GCCAAATTAT 
TATCGAAGTC AATGGGAAGA 
AACTGATACA AAAGATTTAC 
CACACCGTAT ATGATAAACA 
ACCAGAAAAA GCAGCAGCGG 
GACATTACTT TTAATTGCCT 
ACGGAAAAAG ACGACCAGAA 



TTATCGCTCA TTATCAAGCA 
AAGAAAAAAT GGAAAAGAAA 
ATCCTTTTTC TGAAACGCAA 
ATACGATTGG TGTTTTAACC 
CGAATGCATT GCTATTGGAA 
GTGGTACGAA TACACATGCG 
TTACAGATTT GCCAGAATTA 
CGCTTGCTTA TCAAGTAGAT 
ACATTGAGTC TGGCCAAGAT 
GTCATCGGTT ATTAGTTCGA 
GGATGAAAAA AGTGGCACAA 
GTGCGTTAAT TATTAGCGGC 
AACCAA 



EF079-4 (SEQ ID NO:300) 
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PF 

VSDALNNYLD QQIIAHYQAK ASQENTKEMA ELQEKMEKKN QELAKKGSNP GLDPFSETQK 
TTKKPDKSYF ESHTIGVLTI PKINVRLPIF DKTNALLLEK GSSLLEGTSY PTGGTNTHAV 
ISGHRGLPQA KLFTDLPELK KGDEFYIEVN GKTLAYQVDQ IKTVEPTDTK DLHIESGQDL 
VTLLTCTPYM INSHRLLVRG HRIPYQPEKA AAGMKKVAQQ QNLLLWTLLL IACALIISGF 
I IWYKRRKKT TRKP 

EF080-1 (SEQ ID NO:301) 

TAGTTACACT CGTTTAGGGC TAGCAACGTT AGGCATTTTC GCTGGACTCT TAGCACTCTT 
TTTATTAGGA GGTTATTTCC TATGAAAAAA CGACTTTTAC CTATTTTTTT CCTAATACTT 
CTTACCTTTG GCCTTCCCCT ACCCGTTTCG GCGGCTGAAA ATTCAATTGA TGATGGCGCA 
CAATTACTGA CACCTGATCA AATCAACCAA CTAAAGCAAG AGATACAACC TTTAGAAGAA 
AAAACAAAAG CCTCTGTCTT TATTGTAACC ACAAATAATA ATACCTATGG CGATGAACAA 
GAATATGCAG ATCATTATCT TTTAAATAAA GTTGGCAAGG ACCAAAATGC GATTCTTTTT 
CTCATTGATA TGGACTTACG GAAAATCTAC ATCTCTACTT CTGGAAACAT GATTGATTAT 
ATGACAGATG CACGAATTGA TGATACCTTA GATAAAATAT GGGATAATAT GAGTCAAGGA 
AATTATTTCG CGGCTGCTCA AACCTTTGTT CAGGAAACTC AAGCATTTGT TAATAAAGGG 
GTTCCTGGGG GGCACTATCG TGTGGACAGC GAAACAGGTA AAATCACTCG TTATAAAGTC 
ATTACCCCGC TGGAAATGGT AATTGCTTTT GCTGCTGCGC TGATACTCAG TTTGGTCTTC 
TTAGGCATTA ATATTTCTAA ATATCAATTA AAATTTTCAA GTTATCAATA TCCCTTTAGG 
GAAAAAACAA CTTTAAACTT AACCTCCCGC ACAGATCAGT TAACCAACTC TTTCATCACT 
ACGCGTCGTA TTCCTAAAAA CAATGGCGGC AGTGGCGGAA TGGGCGGTGG TGGTAGCACC 
ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC GAAGTTTTTA G 



EF080-2 (SEQ ID NO:302) 



MKKR LLPIFFLILL TFGLALPVSA AENSIDDGAQ 

LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKDQNAILFL 
IDMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN YFAAAQTFVQ ETQAFVNKGV 
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GINISKYQLK FSSYQYPFRE 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS GGMGGGGSTT HSTGGGTFGG GGRSF 



EF080-3 ( SEQ ID NO:303) 



GGCTGAAA ATTCAATTGA TGATGGCGCA 
CAATTACTGA CACCTGATCA AATCAACCAA 
AAAACAAAAG CCTCTGTCTT TATTGTAACC 
GAATATGCAG ATCATTATCT TTTAAATAAA 
CTCATTGATA TGGACTTACG GAAAATCTAC 
ATGACAGATG CACGAATTGA TGATACCTTA 
AATTATTTCG CGGCTGCTCA AACCTTTGTT 
GTTCCTGGGG GGCACTATCG TGTGGACAGC 
ATTACCCCGC TGGAAATGGT AATTGCTTTT 
TTAGGCATTA ATATTTCTAA ATATCAATTA 
GAAAAAACAA CTTTAAACTT AACCTCCCGC 
ACGCGTCGTA TTCCTAAAAA CAATGGCGGC 
ACCCACTCAA CTGGCGGCGG CACATTCGGT 



CTAAAGCAAG AGATACAACC TTTAGAAGAA 
ACAAATAATA ATACCTATGG CGATGAACAA 
GTTGGCAAGG ACCAAAATGC GATTCTTTTT 
ATCTCTACTT CTGGAAACAT GATTGATTAT 
GATAAAATAT GGGATAATAT GAGTCAAGGA 
CAGGAAACTC AAGCATTTGT TAATAAAGGG 
GAAACAGGTA AAATCACTCG TTATAAAGTC 
GCTGCTGCGC TGATACTCAG TTTGGTCTTC 
AAATTTTCAA GTTATCAATA TCCCTTTAGG 
ACAGATCAGT TAACCAACTC TTTCATCACT 
AGTGGCGGAA TGGGCGGTGG TGGTAGCACC 
GGCGGCGGTC GAAGT 



EF080-4 (SEQ ID NO:304) 



LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKDQNAILFL 
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IDMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN YFAAAQTFVQ ETQAFVNKGV 
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GINISKYQLK FSSYQYPFRE 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS GGMGGGGSTT HSTGGGTFGG GGRS 

EF081-1 (SEQ ID NO: 305) 

TGAATGGAAC GAAGCAATCG TAATAAAAAA TCTTCAAAAA AACCACTTAT TCTTGGTGTT 
TCTGCCTTGG TTCTAATCGC TGCTGCCGGT GGCGGGTATT ATGCTTATAG TCAATGGCAA 
GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
TATGATACTA AATC TGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATCAATA CACATTTACC 
TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT 
ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC 
GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TGATAAATTC 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC 
TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG GGGGAAGCAN TGCGCAATTA A 



EF081-2 (SEQ ID NO:306) 

MERSNRNKKS SKKPLILGVS ALVLIAAAGG 
QEFDKLPSW QEASLKKNGY DTKSWEKYQ 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA 
VPITVASEPV TELPTGAATK DTESRYYPLG 



GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 
AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
WKPSLIFPDM SGNDKISIQV DNAKRGEIVD 
NIKAFSDKFG VSVDEINQKL SQGWVQADSF 
EAXRN 



EF081-3 (SEQ ID NO: 307) 



T GGCGGGTATT ATGCTTATAG TCAATGGCAA 

GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
TATGATACTA AATCTGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATCAATA CACATTTACC 
TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT 
ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC 
GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TGATAAATTC 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC 
TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG GGGG 



EF081-4 (SEQ ID NO:308) 



G GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 

QEFDKLPSW QEASLKKNGY DTKSWEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNDKISIQV DNAKRGEIVD 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA NIKAFSDKFG VSVDEINQKL SQGWVQADSF 
VPITVASEPV TELPTGAATK DTESRYYPLG 
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EF082-1 (SEQ ID NO:309) 

TAAAAAATGA AAAAGATCGT GCGCATTTCA AGCATTTTGT TCGTTGCTAC GCCTCTTATG 
CTTTTAAATA GTTCAAAAGT TGAAGCAGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 
ATTACGTTTG CTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG 
CCTGTGACAC CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT 
GACTATGTTT CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT 
TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT 
GTTCAAGTAA CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT 
GCGCAATTTG CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT 
TTAGCAGCAA CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA 
ACCTTGGATC CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA 
ATGGGCACTT GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA 
ACTGTTCCTG CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG 
ATTTTGGATG ATACACCACT TTAA 

EF082-2 (SEQ ID NO:310) 

MKKIVRISS ILFVATPLML LNSSKVEAAQ VASIQSNADI TFALDNTVTP PVNPTNPSQP 
VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI 
LDDTPL 

EF082-3 (SEQ ID NO:311) 

AGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 

ATTACGTTTG CTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG 
CCTGTGACAC CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT 
GACTATGTTT CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT 
TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT 
GTTCAAGTAA CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT 
GCGCAATTTG CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT 
TTAGCAGCAA CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA 
ACCTTGGATC CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA 
ATGGGCACTT GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA 
ACTGTTCCTG CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG 
ATTTTGGATG ATACACCACT 

EF082-4 (SEQ ID NO:312) 

AQ VASIQSNADI TFALDNTVTP PVNPTNPSQP 

VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI 
LDDTP 

EF083-1 (SEQ ID NO:313) 

TAATTTAAAA GACAAGGAGA AATAAAAATG AAAAAGAAAA TTTTAGCAGG AGCGCTTGTC 
GCTCTGTTTT TTATGCCTAC AGCTATGTTT GCCGCAAAAG GAGACCAAGG TGTGGATTGG 
GCGATTTATC AAGGTGAACA AGGTCGCTTT GGCTATGCAC ATGATAAATT CGCTATTGCC 
CAGATTGGAG GCTACAATGC TAGCGGTATT TATGAACAAT ACACATATAA AACGCAAGTG 
GCAAGTGCTA TTGCCCAAGG TAAACGTGCG CATACCTATA TTTGGTATGA CACTTGGGGA 
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AACATGGACA TTGCGAAAAC AACAATGGAT TACTTTTTGC CACGTATTCA AACGCCTAAA 

aISSatcg ttgcattaga TTTTGAACAT GGAGCGTTGG ctagtgttcc agatggatat 
SaSSIS tIagttcaga tgccgaaaaa gcagcaaata cagagacaat tttgtacggt 
a?g?gcIgaa tcaaacaggc tggctatact ccaatgtatt acagctataa gccatttaca 
Sa^ctatca acaaatcatc aaagagtttc ctaactcttt atggattgct 
gcg^tccta tcgatggtct gtcaccatat ccattgtatg cttatttccc aagcatggat 
ggSSa SSgcaatt cacatccgct tatattgcag gtggtttaga tggtaacgta 
SEES gIStacgga tagtggttat acagatacca ataaaccaga aacggatacg 
ccIgcaISg atgcaggcga agaaattgaa aaaataccta attctgatgt taaagttggc 

GATACCGTCA AAGTGAAATT TAATGTAGAT GCTTGGGCAA CTGGGGAAGC TATTCCGCAA 

SggtaaIag g^agcta caaagtgcaa GAAGTAACTG GAAGCAGAGT attgcttgaa 

GG^TC^T CATGGATTAG CAAAGGTGAT ATTGAATTAT TGCCAGACGC AACAGTCGTC 

ccSItaa^c ScSgaagc gactcatgtg gtacaatacg gagaaacatt atcaagtatt 
gcSSc^t aIggaacaga ctatcaaacg ttggcggcat taaatggatt ggctaatcca 

" ATCCTGGTCA AGTTTTGAAA GTCAATGGAT CGGCAACAAG TAATGTCTAC 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AACTTGGCAC TACTTATCAA 
gct?^ caSaaacgg ATTAGCAAAT CCTAACTTGA TTTATCCAGG TCAAACATTG 
AATTATTAA 

EF083-2 (SEQ ID NO:314) 

MK KK I LAG ALVA LFFMPTAMFA AKGDQGVDWA IYQGEQGRFG YAHDKFAIAQ 
^GGYNASGIY EQYTYKTQVA SAIAQGKRAH TYIWYDTWGN MDIAKTTMDY FLPRIQTPKN 
sSdfehg alLvpdgyg GYVSSDAEKA ANTETILYGM RRIKQAGYTP myysykpptl 
NHVNYQQIIK EFPNSLWIAA YPIDGVSPYP LYAYFPSMDG IGIWQFTSAY I^LDGNVD 
LTGITDSGYT DTNKPETDTP ATDAGEEIEK IPNSDVKVGD TVKVKFNVDA WATGEAIPQW 
Sgnsykvqe v^rvIleg ilsvoskgdi ELLPDATWP DKQPEATHW QVGETLSSIA 
YQYGTDYQTL AALNGLANPN LIYPGQVLKV NGSATSNVYT VKYGDNLSSI AAKLGTTYQA 
LAALNGLANP NLIYPGQTLN Y 

EF083-3 (SEQ ID NO:315) 

AAAAG GAGACCAAGG TGTGGATTGG 

GCGATTTATC AAGGTGAACA AGGTCGCTTT GGCTATGCAC ATGATAAATT CGCTATTGCC 
CAGATTGGAG GCTACAATGC TAGCGGTATT TATGAACAAT ACACATATAA AACGCAAGTG 
GcSSgSa TTGCCCAAGG TAAACGTGCG CATACCTATA TTTGGTATGA CACTTGGQOA 
AACATGGACA TTGCGAAAAC AACAATGGAT TACTTTTTGC CACGTATTCA AACGCCTAAA 
AATTCCATCG TTGCATTAGA TTTTGAACAT GGAGCGTTGG CTAGTGTTCC AGATGGATAT 
G^IgGAT^ tUgTTCAGA TGCCGAAAAA GCAGCAAATA CAGAGACAAT rTTGTACGGT 
ATGCGCAGAA TCAAACAGGC TGGCTATACT CCAATGTATT ACAGCTATAA GCCATTTACA 

ctIStcatc taaIctatca acaaatcatc aaagagtttc ctaactcttt atggattgct 

GCGTATCCTA TCGATGGTGT GTCACCATAT CCATTGTATG CTTATTTCCC AAGCATGGAT 
GgSSgGTA TTTGGCAATT CACATCCGCT TATATTGCAG GTGGTTTAGA TGGTAACGTA 
GaStIIcIS gIItTACGGA TAGTGGTTAT ACAGATACCA ATAAACCAGA AACGGATACG 
CC™CAG ATGCAGGCGA AGAAATTGAA AAAATACCTA ATTCTGATGT TAAAGTTGGC 
GATACCGTCA AAGTGAAATT TAATGTAGAT GCTTGGGCAA CTGGGGAAGC TATTCCGCAA 
SStAA^G gIaACAGCTA CAAAGTGCAA GAAGTAACTG GAAGCAGAGT ATTGCTTGAA 
GGTATCTTGT CATGGATTAG CAAAGGTGAT ATTGAATTAT TGCCAGACGC AACAGTCGTC 
CCTGATAAGC AACCAGAAGC GACTCATGTG GTACAATACG GAGAAACATT ATCAAGTATT 
GCTTATCAAT ATGGAACAGA CTATCAAACG TTGGCGGCAT TAAATGGATT ^TAATCCA 
AATCTTATTT ATCCTGGTCA AGTTTTGAAA GTCAATGGAT CGGCAACAAG TAATGTCTAC 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AACTTGGCAC TACTTATCAA 
GCTTTAGCTG CATTAAACGG ATTAGCAAAT CCTAACTTGA TTTATCCAGG TCAAACATTG 
AAT 
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EF083-4 (SEQ ID NO-.316) 

KGDOGVDWA IYQGEQGRFG YAHDKFAIAQ rNT ^ rT1Tlt ,., 
IGGYNASGIY EQYTYKTQVA SAIAQGKRAH TYIWYDTWGN MDIAKTTMDY FLPRIQTPKN 

SvIldfehg alasvpdgyg gyvssdaeka antetilygm rrikqagytp myysykpftl 
S efpnslwtaa ypidgvspyp lyayfpsmdg igiwqftsay ia^dgnvd 

LTGITDSGYT DTNKPETDTP ATDAGEEIEK IPNSDVKVGD TVKVKFNVDA WATGEAIPQW 

™yk5qe vtSrvlleg ilswiskgdi ellpdatwp DKQPEATHW QYGETLSSIA 
™ySqt! aILglanpn liypgqvlkv ngsatsnvyt vkygdnlssi aaklgttyqa 

LAALNGLANP NLIYPGQTLN 
EF084-1 (SEQ ID NO:317) 

TAGTCAAACG TTTATTTTTT CCTTAAATCC AGAAAAAATC CCGTAATTAT GGTACACTAC 
SSSSSS ggIggagaac TATGAAGAAA TTTGATGTAA ttattgtcgg ^ctgggacg 

AGCGGTATGA TGGCCACGAT TGCGGCCGCC GAAGCAGGCG CTCAAGTATT ATTGATTGAA 

ESSSSS SgSgggaa aaaattatta atgactggtg gcggccgctg taatgtaacc 
Jataatcggc ccgcagaaga aatcatttca tttattcctg ggaatggaaa atttttatac 

AGCGCATTTT CACAATTTGA TAACTATGAT ATCATGAACT TTTTTGAATC CAATGGTATT 
CAcSaaIS AAGA^GATCA CGGACGCATG TTCCCTGTTA CAGATAAATC OAAGTCAATT 
GTTGATGCGC TATTTAACCG CATTAACGAA TTAGGAGTCA CTGTTTTTAC AAAAACACAG 
GTCACAAAAT TACTACGAAA AGACGATCAA ATAATTGGCG TTGAAACCGA ACTGGAAAAA 

SaS cgtgtgttgt attaacaact ggcggccgca cttatccttc cacaggagca 
SgSatg gctataaact agccaaaaaa atggggcata ccatcagccc gctctaccct 
accgaatcac ctattatttc tgaagaacct tttatcctgg ataaaacgtt gcaaggtctc 
iSSJSJS atgttaattt aactgttttg aaccaaaaag gaaaaccttt agttaatcat 
SISgga^a tgctgtttac acattttggc atttcaggac ctgccgcgct ccgctgttct 
ag^Satta accaagaatt aactcgcaac ggtaatcaac ctgtcacggt agccttggat 

Sg™ CAAAATCTTT TGAAGAAGTG CCTGCCAAAC AACTAACAGA AAAGCAACGN 
CTTTCCTTTG TGGAACTACT GAAAGACTTT CAGTTCACTG TTACGAAAAC ATOCCTTTC 
GAAAAATCTT TTGTCACAGG CGGTGGGATT TCCCTCAAAG AAGTGACCCC TAAAACAATG 
gaIa^caIIt tagtcaatgg TTTATTTTTT GCTGGTGAAC TITTAGATAT TAATGGCTAT 

acSgagIS acaatgttac agctgcattt gtcactggac atgttgctgg ctcccatgcc 

GCAGAAATTG CAGAATACAC CTATTTACCA ATTGAAGAAG TCTAA 
EF084-2 (SEQ ID NO: 318) 

MKKF DVIIVGAGTS GMMATIAAAE AGAQVLLIEK mm , m „ TU 

NRRVGKKLLM TGGGRCNVTN NRPAEEIISF IPGNGKFLYS AFSQFDNYDI MNFFESNGIH 

lkeedhgrmf pvtdksksiv dalfnrinel gvtvftktqv tkllrkddqi igveteleki 

YAPCWLTTG GRTYPSTGAT GDGYKLAKKM GHTISPLYPT ESPIISEEPF ILDKTLQGLS 

lqdvnltvln qkgkplvnhq mdmLjFTHfgi sgpaalrcss finqeltrng nqpvtvaldv 
FPTOSFEEVP akqltekqrl sfvellkdfq ftvtktlple ksfvtgggis lkevtpktme 

SKLVNGLFFA GELLDINGYT GGYNVTAAFV TGHVAGSHAA EIAEYTYLPI EEV 
EF084-3 (SEQ ID NO:319) 

C GAAGCAGGCG CTCAAGTATT ATTGATTGAA 

AAAAATCGCC GTGTTGGGAA AAAATTATTA ATGACTGGTG GCGGCCGCTG TAATGTAACC 
AATAATCGGC CCGCAGAAGA AATCATTTCA TTTATTCCTG GGAATGGAAA ATTTTTATAC 
AGCGCATTTT CACAATTTGA TAACTATGAT ATCATGAACT TTTTTGAATC CAATGGTATT 
CACTTAAAAG AAGAAGATCA CGGACGCATG TTCCCTGTTA CAGATAAATC GAAGTCAATT 
GTTGATGCGC TATTTAACCG CATTAACGAA TTAGGAGTCA CTGTTTTTAC AAAAACACAG 
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GTCACAAAAT TACTACGAAA AGACGATCAA ATAATTGGCG TTGAAACCGA ACTGGAAAAA 
ATTTATGCAC CGTGTGTTGT ATTAACAACT GGCGGCCGCA CTTATCCTTC CACAGGAGCA 
AC TGGTGATG GCTATAAACT AGCCAAAAAA ATGGGGCATA CCATCAGCCC GCTCTACCCT 
ACCGAATCAC CTATTATTTC TGAAGAACCT TTTATCCTGG ATAAAACGTT GCAAGGTCTC 
TCTTTACAAG ATGTTAATTT AACTGTTTTG AACCAAAAAG GAAAACCTTT AGTTAATCAT 
CAAATGGATA TGCTGTTTAC ACATTTTGGC ATTTCAGGAC CTGCCGCGCT CCGCTGTTCT 
AGTTTTATTA ACCAAGAATT AACTCGCAAC GGTAATCAAC CTGTCACGGT AGCCTTGGAT 
GTGTTTCCGA CAAAATCTTT TGAAGAAGTG CCTGCCAAAC AACTAACAGA AAAGCAACGN 
CTTTCCTTTG TGGAACTACT GAAAGACTTT CAGTTCACTG TTACGAAAAC ATTGCCTTTG 
GAAAAATCTT TTGTCACAGG CGGTGGGATT TCCCTCAAAG AAGTGACCCC TAAAACAATG 
GAGAGCAAAT TAGTCAATGG TTTATTTTTT GCTGGTGAAC TTTTAGATAT TAATGGCTAT 
ACTGGAGGCT ACAATGTTAC AGCTGCATTT GTCACTGGAC ATGTTGCTGG CTCCCATGCC 
GCAGAAATTG CAGAATACAC CTATTTACCA ATTGAAGAAG TC 

EF084-4 (SEQ ID NO: 3 20) 

E AGAQVLLIEK 

NRRVGKKLLM TGGGRCNVTN NRPAEEIISF IPGNGKFLYS AFSQFDNYDI MNFFESNGIH 
LKEEDHGRMF PVTDKSKSIV DALFNRINEL GVTVFTKTQV TKLLRKDDQ I IGVETELEKI 
YAPCWLTTG GRTYPSTGAT GDGYKLAKKM GHTISPLYPT ESPIISEEPF ILDKTLQGLS 
LQDVNLTVLN QKGKPLVNHQ MDMLFTHFGI SGPAALRCSS FINQELTRNG NQPVTVALDV 
FPTKSFEEVP AKQLTEKQRL SFVELLKDFQ FTVTKTLPLE KSFVTGGGIS LKEVTPKTME 
SKLVNGLFFA GELLDINGYT GGYNVTAAFV TGHVAGSHAA EIAEYTYLPI EEV 

EF085-1 (SEQ ID NO:321) 

TAACCCATGA AATCATTTTG TCCCGCATAT GGGGATATGA CTTTGACGGT GATGGCAGCA 
CAGTCCACAC TCATATCAAA AATCTGCGGG CGAACTGCCG GAAAATATCA TCAAAACCAT 
CCGCGGTGTA GGTTACCGAT TGGAGGAATC ATTATAATGG AAAGAAAAGG GATTTTCATT 
AAGGTTTTTT CCTATACGAT CATTGTCCTG TTACTGCTTG TCGGTGTAAC GGCAACACTG 
TTTGCACAGC AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC 
TATCAGCCAT TGGTGGAACT GATTCAGAAT AGCGATAGGC TTGATATGCA AGAGGTGGCA 
GGGCTGTTTC ACTACAATAA CCAATCCTTT GAGTTTTATA TTGAAGATAA AGAGGGAAGC 
GTACTCTATG CCACACCGAA TGCCGATACA TCAAATAGTG TTAGGCCCGA CTTTCTTTAT 
GTGGTACATA GAGATGATAA TATTTCGATT GTTGCTCAAA GCAAGGCAGG TGTGGGATTG 
CTTTATCAAG GGCTGACAAT TCGGGGAATT GTTATGATTG CGATAATGGT TGTATTCAGC 
CTTTTATGCG CGTATATCTT TGCGCGGCAA ATGACAACGC CGATCAAAGC CTTAGCGGAC 
AGTGCGAATA AAATGGCAAA CCTGAAAGAA GTACCGCCGC CGCTGGAGCG AAAGGATGAG 
CTTGGCGCAC TGGCTCACGA CATGCATTCC ATGTATATCA GGCTGAAAGA AACCATCGCA 
AGGCTGGAGG ATGAAATCGC AAGGGAACAT GAGTTGGAGG AAACACAGCG ATATTTCTTT 
GCGGCAGCCT CTCATGAGTT AAAAACGCCC ATCGCGGCTG TAAGCGTTCT GTTGGAGGGA 
ATGCTTCAAA ATATCGGTGA CTACAAAGAC CATTCTAAGT ATCTGCGCGA ATGCATCAAA 
ATGATGGACA GGCAGGGCAA AACCATTTCC GAAATACTGG AGCTTGTCAG CCTGAACGAT 
GGGAGAATCG TACCCATAGC CGAACCGCTG GACATAGGGC GCACGGTTGC CGAGCTGCTA 
CCCGATTTTC AAACCTTGGC AGAGGCAAAC AACCAGCGGT TCGTCACAGA TATTCCAGCC 
GGACAAATTG TCCTGTCCGA TCCGAAGCTG ATCCAAAAGG CGCTATCCAA TGTCATATTG 
AATGCGGTTC AGAACACGCC CCAGGGAGGT GAGGTACGGA TATGGAGTGA GCCTGGGGCT 
GAAAAATACC GTCTTTCCGT TTTGAACATG GGCGTTCACA TTGATGATAC TGCACTTTCA 
AAGCTGTTCA TCCCATTCTA TCGCATTGAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG 
CGGTTTGGGG CTTGCCATCG TACAAAAAAC GCTGGATGCC ATGAGCCTCC AATATGCGCT 
GGAAAACACC TCAGATGGCG TTTTGTTCTG GCTGGATTTA CCGCCCACAT CAACACTATA 
AATATTTAA 
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EF085-2 (SEQ ID NO:322) 

JSSSL LLVGVTATLF AQQFVSYFRA MEAQQTVKSY QPLVELIQNS DRLDMQEVAG 
LFHYNNQSFE FYIEDKEGSV LYATPNADTS NSVRPDFLYV VHRDDNISIV AQSKAGVGLL 
™HGIV MIAIMWFSL LCAYIFARQM TTPIKALADS A^MANL™ » 
GALAHDMHSM YIRLKETIAR LEDEIAREHE LEETQRYFFA AASHELKTPI AAVSVLLEGM 
^PMIGDYKDH SKYLRECIKM MDRQGKTISE ILELVSLNDG RIVPIAEPLD IGRTVAELLP 

dfqtlaS qrf^tdipS qivlsdpkli QKALSNVILN avqntpqgge vriwsepgae 
kyrlsS ShSdtalsk lfipfyridq arsskkwakr fgachrtkna gcheppicag 

KHLRWRFVLA GFTAHINTIN I 
EF085-3 (SEQ ID NO:323) 

GC AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC 
TATCAGCC AT TGGTGG AAC T GATTCAGAAT AGCGATAGGC TTGATATGCA AGAGGTGGCA 

SgctctSc actacaataa ccaatccttt gagttttata ttgaagataa agagggaagc 
gSc^tItg ccIcaccgaa tgccgataca tcaaatagtg ttaggcccga ctttctttat 
SggScaS gagatgataa tatttcgatt gttgctcaaa gcaaggcagg tgtgggattg 
cS?ISg ggSgacaat tcggggaatt gttatgattg cgataatggt tgtattcagc 

™JSJg CGTATATCTT TGCGCGGCAA ATGACAACGC CGATCAAAGC CTTAGCGGAC 

StSgaata aStggcaaa cctgaaagaa gtaccgccgc cgctggagcg aaaggatgag 

- r _ nv , rrrCAC TGGCTCACGA CATGCATTCC ATGTATATCA GGCTGAAAGA AACCATCGCA 

agSggaS SSSSSc aagggaacat gagttggagg aaacacagcg atatttcttt 
SggcagcS ctcatgagtt aaaaacgccc atcgcggctg taagcgttct gttggaggga 
SgIttSSI SSSotS ctacaaagac cattctaagt atctgcgcga atgcatcaaa 
atgItSaIa Scagggcaa aaccatttcc gaaatactgg agcttgtcag cctgaacgat 

GGGAGAATCG TACCCATAGC CGAACCGCTG GACATAGGGC GCACGGTTGC CGAGCTGCTA 
CCCGAOTC AAACCTTGGC AGAGGCAAAC AACCAGCGGT TCGTCACAGA TATTCCAGCC 

ggacaIISg t^tgtccga tccgaagctg atccaaaagg cgctatccaa tgtcatattg 
^atocIgSc agaacacgcc ccagggaggt gaggtacgga tatggagtga gcctggggct 

JESSES G^lcGT TTTGAACATG GGCGTTCACA TTGATGATAC TGCACTTTCA 
AAGCTGTTCA TCCCATTCTA TCGCATTGAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG 

cSgggg cttgccatcg tacaaaaaac gctggatgcc atgagcctcc aatatgcgct 
ggaHIScc tcagatggcg ttttgttctg gctggattta ccgcccacat caacactata 

AATATTT 

EF085-4 (SEQ ID NO:324) 

OPVqYFRA MEAOOTVKSY QPLVELIQNS DRLDMQEVAG 

LFHYNNQSFE FYIEDKEGSV LYATPNADTS NSVRPDFLYV VHRDDNISIV AQSKAGVGLL 
MIAIMWFSL LCAYIFARQM TTPIKALADS ANKMANLKEV PPPLERKDEL 
GALAHDMHSM YIRLKETIAR LEDEIAREHE LEETQRYFFA AASHELKTPI AAVSVLLEGM 
LEN1GDYKDH SKYLRECIKM MDRQGKTISE ILELVSLNDG RIVPIAEPLD IGRTVAELLP 
DFQTLaS QRFvSlPAG QIVLSDPKLI QKALSNVILN AVQNTPQGGE VRIWSEPGAE 
KYrSS WIDDTALSK LFIPFYRIDQ ARSSKKWAKR FGACHRTKNA GCHEPPICAG 
KHLRWRFVLA GFTAHINTIN I 

EF086-1 (SEQ ID NO:325) 

TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT GATTTTATTA 
SSSSSS TATTGAATAC AGATGCAGAA AAATTAAATA AATTTACTGC ^CGCTGATG 
cStccaa AAGATCCAAA CATACAATGG CCAATTTATC GTGCAACAGG agctaactta 

aSgatISt cIStcaccgt tttaggtact ggacttttgt tagaagataa tcaacgccta 
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GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT ACAATTGATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AACCTTAATT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA CAOAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT 
AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG AATGAACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC GTATGATCCA 
ATOTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGC TTGTG ATTATTGGTC TTCTAGTTAT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF086-2 (SEQ ID NO:326) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP IYRATGANLT 
DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG RSISRAPETN 
PFTTEFESGK ETIANLTLIA KFAPENLRND IYTSIQTWLQ QSGSYYHFFK KPRDFEALID 
LKNWNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI LDNRMIHPQE 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
TNTFAKISKN YGKTVENGTY EYLTWGKTN EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
MMNTWMNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASVSIEF DKGILEWAA 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
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EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAKAKL —X- ~HQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIALVl 
LLVFRKSKSK K 

EF086-3 (SEQ ID NO: 327) 

ACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA^^C^GTG^CTT 

CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTI^ GCAATCTTTA 

= 5= =i = = = 

ESS S= — 5- 5SS = 

GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGG TGGCTCAAAT 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCACi 



AAT 

EF086-4 (SEQ ID NO:328) 



PENLRND IYTSIQTWLQ QSGSYYHFFK KPR ^" D FGNTENKKGW 

ssss ssss ssss =r BS 

EF087-1 (SEQ ID NO:329) 

TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT GATTTTATTA 
CATGATGACC TATTGAATAC AGATGCAGAA JAATTAAATA AATTTACTGC ^ 
CTGTATGCAA AAGATCCAAA ^ACAATGG CCAATTTATC ™J£J TCAACGCCTA 
ACAGATATTT CAATCACCGT TTT^OTACT ^^TTTTGT ^ ^gctta 
GTACAAGTAC AAGAAGCTGT JCCCTCCOTT WAAAAMTO TTACGGGAAT 
TATCCTGATG GTTCCTTGAT TCAACATGGT T ATTTTCCG1 * CTGGGAGATG 

~ = EE = Ss sss 

GTAAATGGAA AAATGCCATC GATGGTTTC1 TAGCTAATTT AACCTTAATT 

AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA ^ 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GA GA11J-1^ 

SSSSSa atgtagtgaa tagtgcgtca cctgcccaag c-acaccaat ocaatcttta 

AATGTATATG GTTCGATGGA TCGAGTCCTA ^AAAAATA ACGAATATGC taaaaaaqgc 

agtatgtatt cacaacgtgt cggaaactat gaatttggga atac GTTTG atgaa 

TGGCATACAG CAGACGGCAT GCTTTATTTA ^AATCAAG ACTTTGCT q ^ 
GGATACTGGG CAACGATCGA TCCATATCGA ^CCAGGAA CGACAG TGGCTCAAAT 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTA AATGAACTTA 
AATGGACAGG TTGCCTCTAT AGGAATGTTT ™ATAAAA ^JATGAAGG ^ 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA ™£l ^ TCATCCACAG 

sss see ses sss SEE? sss 
sss sss sss see ™ -s-s 

TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGALl^ii 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC ^^CCA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GG^TTTATC JCTTAACTCT ^GAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTCaA «. 
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GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 

mmmmm 
mmmmmm 

SESSS SSSSS ESSES S CG ™ 

TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 
EF087-2 {SEQ ID NO-.330) 

mmmmmm 

mmmmmm 
====== 

«hlS stovdqtctk qvkpssqggf rkasqflpst gekksialvi igllviasgc 



LLVFRKSKSK K 
EF087-3 (SEQ ID NO-.331) 



mmmmmm 

Tirr AATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT i 1 ^™** 

IaaggSaS ctcSc-Sga aaatacagca aacttacaag ccattgaagc aggtaattat 

EE S= SSSS =5= SSS5S EES 

TTACAAAATA ATGCATCC 



EF087-4 (SEQ ID NO:332) 
NRMIHPQE 



SoGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFWDKTY 

StfSiskn ygkweng^ eyltwgktn eeiaalsknk gytvlektan lqai^gnyv 

MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNAS 



EF088-1 (SEQ ID NO:333) 
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TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT GATTTTATTA 
CATGATGACC TATTGAATAC AGATGCAGAA AAATTAAATA AATTTACTGC TCCGCTGATG 
CTGTATGCAA AAGATCCAAA CATACAATGG CCAATTTATC GTGCAACAGG AGCTAACTTA 
ACAGATATTT CAATCACCGT TTTAGGTACT GG AC TTTTGT TAGAAGATAA TCAACGCCTA 
GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT ACAATTGATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AACCTTAATT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA CACAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT 
AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG AATGAACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC GTATGATCCA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF088-2 (SEQ ID NO:334) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP IYRATGANLT 

DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 

LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG RSISRAPETN 

PFTTEFESGK ETIANLTLIA KFAPENLRND IYTSIQTWLQ QSGSYYHFFK KPRDFEALID 

LKNWNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 

HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 

GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI LDNRMIHPQE 

VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
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TNTFAKISKN YGKTVENGTY EYLTWGKTN 
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT 
YSEALKQAQT VADQTTATQA EVDQAETELR 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF 
LLVFRKSKSK K 



EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
VYRLTLANPL QNNASVSIEF DKGILEWAA 
PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
RKASQFLPST GEKKSIALVI IGLLVIASGC 



EF088-3 (SEQ ID NO:335) 



A ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 

AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAA 



EF088-4 (SEQ ID NO:336) 

T PEVTKEALEK LIQEQKEHQE KDYTASSWKV 

YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKK 



EF089-1 (SEQ ID NO:337) 



TGACAGATAC ACCTGCTAAC 
TATAGGTCAA AAATTTTTTG 
AATGACAGAC ATAGGAGAAT 
TGTATGTTAT TTGGCTGGAT 
ACACCAACAA TTCCCGAAAA 
GCGCCTGGTG CCAAACAAAC 
ACCATTGAAA ATACGGTGAA 
CAAAACGGGA TCAAACCTGA 
CCGAAAGAAA TCATCTTGCC 
CCTAAAGATT CTTTTGATGG 
GAAACAACGA CTTCTGCGGA 
GTTGTGGCTA TTATTCTTCA 
GGGGTTAAAC CAGGCCAAGT 
CAAGCGGCCT ATTTAAACCA 
CTTTACCAAT CCGATACTGA 
ATTTCTTTAA AAGGGGAACG 
GGTGTAAAAG ATGAAAAGGG 
CTGTACAAAT GGG AATTTAC 
AATGAAAAAG ACGTAACCAT 
ATCATTCTAG CGCTGCTCTT 
GAACAACAAT CTGAGCAATA 



ACAGGAAACT AAGAACGACA 
GCTTATCTTT CGGTCTTTTG 
GAATATGAAC AGATGGAAAG 
TGGCGTGGAG GCGCACGCTT 
TCAAGTGGAT AAATCAAAAA 
CGTAGAAATT CAGTTACGCA 
CTCAGCGACA ACAAATTTAA 
CAAAACCTTA CGTTTTAACT 
GAAGCATTCC CAAAAGACCT 
CGTGATGGCT GGCGGTATAA 
TCAATCAAAA GGGTTAGCTA 
GCAAAATGAG ACAAAGGTTC 
CAACGCGCGA AACGTCATCA 
ATTACATTTA ■ ATC AACACTG 
GGATATGCAA GTGGCGCCAA 
ATTAACGCCA GGAAAATATG 
CACCTATCAA GTCAAAGGCG 
AAAAGAATTT ACTATTTCTG 
TAAAGGAACC AATTGGTGGT 
ATTGATTTTC TTCTTGTATC 
A 



GCATACACGC AAGATCGGGA 
GTGCTTATAA TACAACAAAG 
TATATGCAAC GGTAATCGCT 
CTGAATTTAA TTTTGCGGTC 
CCTACTTTGA CTTAAAAATG 
ATGATACAGA TGAAGACATT 
ATGGCGTAGT AGAATATGGC 
TAAAAGATTA TGTGGAAGCA 
TACCTTTAAC CATTACGATG 
CACTCAAAGA GAAAAAGAAA 
TTAATAATGA ATACTCCTAT 
AACCAGATTT AAAATTACTG 
ATGTTTCTTT ACAAAACCCA 
TTTCAAAAGG AGGCGAAACG 
ACTCTAACTT TAGTTACCCA 
TCTTGAAATC AACGGCCTAT 
CCAATGGTGA AGAACGGTAC 
GGGACGTCGC TAAAGAATTA 
TGTATCTACT GATTGCATTA 
GTAAAAAGAA AAAAGAGGAA 



EF089-2 (SEQ ID NO:338) 
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MNR WKVYATVIAC 

MLFGWIGVEA HASEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT 
IENTVNSATT NLNGWEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTITMP 
KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAIILQQNET KVQPDLKLLG 
VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI 
SLKGERLTPG KYVLKSTAYG VKDEKGTYQV KGANGEERYL YKWEFTKEFT I SGDVAKELN 
EKDVTIKGTN WWLYLLIALI ILALLLLIFF LYRKKKKEEE QQSEQ 

EF089-3 (SEQ ID NO:339) 
T CTGAATTTAA TTTTGCGGTC 

ACACCAACAA TTCCCGAAAA TCAAGTGGAT AAATCAAAAA CCTACTTTGA CTTAAAAATG 
GCGCCTGGTG CCAAACAAAC CGTAGAAATT CAGTTACGCA ATGATACAGA TGAAGACATT 
ACCATTGAAA ATACGGTGAA CTCAGCGACA ACAAATTTAA ATGGCGTAGT AGAATATGGC 
CAAAACGGGA TCAAACCTGA CAAAACCTTA CGTTTTAACT TAAAAGATTA TGTGGAAGCA 
CCGAAAGAAA TCATCTTGCC GAAGCATTCC CAAAAGACCT TACCTTTAAC CATTACGATG 
CCTAAAGATT CTTTTGATGG CGTGATGGCT GGCGGTATAA CACTCAAAGA GAAAAAGAAA 
GAAACAACGA CTTCTGCGGA TCAATCAAAA GGGTTAGCTA TTAATAATGA ATACTCCTAT 
GTTGTGGCTA TTATTCTTCA GCAAAATGAG ACAAAGGTTC AACCAGATTT AAAATTACTG 
GGGGTTAAAC CAGGCCAAGT CAACGCGCGA AACGTCATCA ATGTTTCTTT ACAAAACCCA 
CAAGCGGCCT ATTTAAACCA ATTACATTTA ATCAACACTG TTTCAAAAGG AGGCGAAACG 
CTTTACCAAT CCGATACTGA GGATATGCAA GTGGCGCCAA ACTCTAACTT TAGTTACCCA 
ATTTCTTTAA AAGGGGAACG AT 

EF089-4 (SEQ ID NO: 340) 

SEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT 

IENTVNSATT NLNGWEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTITMP 

KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAIILQQNET KVQPDLKLLG 

VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI 

SLKGER 

EF090-1 (SEQ ID NO:341) 

TAGTCTCTAA GAAATAAACC TAAAATTATT GATATAAAGG ATGAACAAAT GAAAAAAGAA 
GAAATGCAAA TGCGTAATAC ACGTCGTCAA AAATCAGGAA AAAATAATAA AAAGAAAGTA 
ATTATTACTT CTTTGGTTGG ACTAGCTCTG GTTGCTGGGG GCAGTTATGT TTATTTTCAA 
AGTCACTTTT TNCCAACCAC AAAAGTAAAT GGAGTTTCTG TAGGCTGGTT AAATGTAAAT 
GCTGCAGAAG AAAAATTAGC GCAAGTTAAT CAAACCGAAG AAGTTGTGGT TCAAACGGGG 
ACAAAAGAAG AAAAAATTCA ACTTCCTAAA AAATACCAAT TGGATCAAAA ATTTTTAAAA 
GACCATTTAC ACAGTAGCAA GGTGAAGCTA CCGTTAAACG AGGCATTCAA AAAAGAACTA 
GAAGCCAAAT TAGCAACTTT GAGTTTTCCA GAGGGGAAAC CAAGCAAAAA TGCGAGTATC 
CGTCGAGGCA ATGGCACTTT TGAAATTGTT CCCGAAGAAC AAGGCACAGT AGTGGACACA 
CAGCGCTTAA ACCAGCAGAT TATTGCGGAT GTTGAAGCGG GAAAAGGCAA CTATCAATAT 
AATGCCAAAG ATTTTTATAA AGCCCCTGAA ATTACAAAAG AGGATCAAAC GTTAAAGGCA 
ACATTGACAA CGCTCAATAA CAAGTTAAAT AAAACAATTA CAGTTGATAT TAATGGTGAA 
AAAGTAGCCT TTGATAAAAC ACAAATTCAA AACGTGCTGA ATGATGATGG CACAATCAAC 
AAAGAAAAAC TAACTACTTG GGTGACACAA TTAGAAACAA CATATGGTTC TGCTAATCAA 
CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTCGTT TTAAAAACAA CGGAAGTTAT 
GGCTGGTCGA TTGATGGGGC CAAAACGCAA GAACTACTAG TAAACGCGCT GAATAGCCAA 
GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTGGTGATA CCAAAGAAAA TAGTAAAATT 
GCCAATAATT ACATTGAAAT TGATTTAAAA GATCAAAAAA TGTATTGTTT CATTGATGGC 
AAAAAAATAG TCACCACAGA TGTCATTACT GGCAGATATA ACAAAGGAAC CGCAACAGTA 



WO 98/50554 



PCT/US98/08959 



183 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 

CCAGGATTCC ATACAATTTT ATATCGGACA ACCGATGTGA ATTTAGAAGG TCAAATGCTT 
GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGGATGC CGTTATTAAG TCAAGGGGGC 
GTTGTCACAC AAATCGGGAT TCATGACTCC GACCATAAAT TGGATAAGTA TGGCGATAAA 
GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGCTGTATCA ATACGCCAGG AACAGAAGTT 
TCAAAAATCT TTGATGTATC CTATGACGGA ATGCCGGTAA TTATTTATGG ACATATCTAT 
GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGT ATAA 



EF090-2 (SEQ ID NO:342) 

MRNTRRQK SGKNNKKKVI ITSLVGLALV AGGSYVYFQS 

HFXPTTKVNG VSVGWLNVNA AEEKLAQVNQ TEEWVQTGT KEEKIQLPKK YQLDQKFLKD 
HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTWDTQ 
RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK 
VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG 
WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK 
KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGGV 
VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 
DAPGEFDKPV DYGEEV 

EF090-3 (SEQ ID NO:343) 

CAC AAAAGTAAAT GGAGTTTCTG TAGGCTGGTT AAATGTAAAT 

GCTGCAGAAG AAAAATTAGC GCAAGTTAAT CAAACCGAAG AAGTTGTGGT TCAAACGGGG 
ACAAAAGAAG AAAAAATTCA ACTTCCTAAA AAATACCAAT TGGATCAAAA ATTTTTAAAA 
GACCATTTAC ACAGTAGCAA GGTGAAGCTA CCGTTAAACG AGGCATTCAA AAAAGAACTA 
GAAGCCAAAT TAGCAACTTT GAGTTTTCCA GAGGGGAAAC CAAGCAAAAA TGCGAGTATC 
CGTCGAGGCA ATGGCACTTT TGAAATTGTT CCCGAAGAAC AAGGCACAGT AGTGGACACA 
CAGCGCTTAA ACCAGCAGAT TATTGCGGAT GTTGAAGCGG GAAAAGGCAA CTATCAATAT 
AATGCCAAAG ATTTTTATAA AGCCCCTGAA ATTACAAAAG AGGATCAAAC GTTAAAGGCA 
ACATTGACAA CGCTCAATAA CAAGTTAAAT AAAACAATTA CAGTTGATAT TAATGGTGAA 
AAAGTAGCCT TTGATAAAAC ACAAATTCAA AACGTGCTGA ATGATGATGG CACAATCAAC 
AAAGAAAAAC TAACTACTTG GGTGACACAA TTAGAAACAA CATATGGTTC TGCTAATCAA 
CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTCGTT TTAAAAACAA CGGAAGTTAT 
GGCTGGTCGA TTGATGGGGC CAAAACGCAA GAACTACTAG TAAACGCGCT GAATAGCCAA 
GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTGGTGATA CCAAAGAAAA TAGTAAAATT 
GCCAATAATT ACATTGAAAT TGATTTAAAA GATCAAAAAA TGTATTGTTT CATTGATGGC 
AAAAAAATAG TCACCACAGA TGTCATTACT GGCAGATATA ACAAAGGAAC CGCAACAGTA 
CCAGGATTCC ATACAATTTT ATATCGGACA ACCGATGTGA ATTTAGAAGG TCAAATGCTT 
GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGGATGC CGTTATTAAG TCAAGGGGGC 
GTTGTCACAC AAATCGGGAT TCATGACTCC GACCATAAAT TGGATAAGTA TGGCGATAAA 
GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGCTGTATCA ATACGCCAGG AACAGAAGTT 
TCAAAAATCT TTGATGTATC CTATGACGGA ATGCCGGTAA TTATTTATGG ACATATCTAT 
GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGT AT 

EF090-4 (SEQ ID NO:344) 

TKVNG VSVGWLNVNA AEEKLAQVNQ TEEWVQTGT KEEKIQLPKK YQLDQKFLKD 
HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTWDTQ 
RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK 
VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG 
WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK 
KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGGV 
VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 
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DAPGEFDKPV DYGEEV 
EF091-1 (SEQ ID NO:345) 

TAATTGGNGG AGATTTTTAT GGCTAAAAAA GGCGGATTTT TCTTAGGNGC AGTAATTGGT 
GGAACAGCAG CAGCCGTTGC CGCATTATTA CTTGCACCAA AATCAGGTAA AGAATTACGT 
GATGATTTAT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 
GCTGTTCAAA AAGGAACAGA ATTAACAGAA ATCGCAAAAC AAAAAGCCGG CGTTTTATCA 
GATCAAGCCT CTGATTTGGC AGGTTCTGTC AAAGAAAAAA CAAAAGATTC ATTGGATAAA 
GCACAAGGTG TTTCTGGCGA CATGCTTGAT AACTTTAAAA AACAAACAGG TGATTTATCT 
GATCAATTTA AAAAAGCAGC TGACGATGCT CAAGATCACG CAGAAGATTT AGGTGAAATT 
GCCGAAGATG CAGCAGAAGA TATCTATATT GACGTTAAAG ATTCTGCGGC AGCGGCCAAA 
GAAACTGTTT CTGCTGGTGT CGATGAAGCA ANAGAAACCA CCAAAGATGT TCCTGAAAAA 
GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT GCAGCGAAAG ACGTAAAAAA AGAATTTAAA 
GGGTAA 

EF091-2 (SEQ ID NO:346) 

MAKKG GFFLGAVIGG TAAAVAALLL APKSGKELRD DLSNQTDDLK NKAQDYTDYA 
VQKGTELTEI AKQKAGVLSD QASDLAGSVK EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA 
AEAKEDVKDA AKDVKKEFKG 

EF091-3 (SEQ ID NO:347) 

AT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 
GCTGTTCAAA AAGGAACAGA ATTAACAGAA ATCGCAAAAC AAAAAGCCGG CGTTTTATCA 
GATCAAGCCT CTGATTTGGC AGGTTCTGTC AAAGAAAAAA CAAAAGATTC ATTGGATAAA 
GCACAAGGTG TTTCTGGCGA CATGCTTGAT AACTTTAAAA AACAAACAGG TGATTTATCT 
GATCAATTTA AAAAAGCAGC TGACGATGCT CAAGATCACG CAGAAGATTT AGGTGAAATT 
GCCGAAGATG CAGCAGAAGA TATCTATATT GACGTTAAAG ATTCTGCGGC AGCGGCCAAA 
GAAACTGTTT CTGCTGGTGT CGATGAAGCA ANAGAAACCA CCAAAGATGT TCCTGAAAAA 
GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT GCAGCGAAAG ACGTAAAAAA AGAATTTAAA 
GGGTAA 



EF091-4 (SEQ ID NO:348) 

SNQTDDLK NKAQDYTDYA 
VQKGTELTEI AKQKAGVLSD QASDLAGSVK 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID 
AEAKEDVKDA AKDVKKEFKG 

EF092-1 (SEQ ID NO:349) 



EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD 
VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA 



TAAGGGGATG AAGAAAAAAT GGCAAAAAAA ACAATTATGT TAGTTTGTTC CGCAGGAATG 
AGCACGAGTT TATTAGTAAC AAAAATGCAA AAAGCAGCAG AAGATCGTGG CATGGAAGCA 
GACATCTTTG CAGTATCGGC TTCTGAAGCA GATACAAACT TGGAAAATAA AGAGGTGAAT 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC ATGAAAGGGC AATTTGAACA AAAATTACAA 
CCAAAAGGGA TTCCTTTAGA TGTAATTAAC ATGGCAGATT ATGGCATGAT GAATGGCGAA 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG GGATAA 

EF092-2 (SEQ ID NO:350) 

MAKKT IMLVCSAGMS TSLLVTKMQK AAEDRGMEAD IFAVSASEAD TNLENKEVNV 
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LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK VLDQAISLMG 
EF092-3 (SEQ ID NO:351)' 
AG AAGATCGTGG CATGGAAGCA 

GACATCTTTG CAGTATCGGC TTCTGAAGCA GATACAAACT TGGAAAATAA AGAGGTGAAT 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC ATGAAAGGGC AATTTGAACA AAAATTACAA 
CCAAAAGGGA TTCCTTTAGA TGTAATTAAC ATGGCAGATT ATGGCATGAT GAATGGCGAA 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG GGAT 

EF092-4 (SEQ ID NO:352) 

EDRGMEAD IFAVSASEAD TNLENKEVNV 

LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK VLDQAISLMG 
EF093-1 (SEQ ID NO:353) 

TAGTTTTTTT CCGATAAAGG GAGAATTTTA ATGAGGCAAA AATATTCAGG AAACTTATTG 
TTCACGGCCA TGGCCATTGT TTATTTGATG AGTTTTCTCG CCCTTCAGTT ACTAGAAGAA 
CGTCAGTTAA CACAAAAATT TACGCAAGCT ACCCAGGAAT ACTATGCAGG GAAAAGTATC 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAGACGAA AGTTAAAAAC AGAAGAAAGG 
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACATACAAAA ATGAACAATT AAGAATAACT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAATATCAAG AGAGAGTTTC TCATCAAAAA 
AAAGCGGAAA CAATACTGGA ATAG 

EF093-2 (SEQ ID NO:354) 

M RQKYSGNLLF TAMAIVYLMS FLALQLLEER QLTQKFTQAT QEYYAGKSIF 
HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ YQERVSHQKK 
AETILE 

EF093-3 (SEQ ID NO:355) 
CCTTCAGTT ACTAGAAGAA 

CGTCAGTTAA CACAAAAATT TACGCAAGCT ACCCAGGAAT ACTATGCAGG GAAAAGTATC 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAGACGAA AGTTAAAAAC AGAAGAAAGG 
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACATACAAAA ATGAACAATT AAGAATAACT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAATATCAAG AGAGAGTTTC TCATCAAAAA 
AAAGCGGAAA CAATACTGG 

EF093-4 (SEQ ID NO:356) 

LQLLEER QLTQKFTQAT QEYYAGKSIF 

HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ YQERVSHQKK 
AETI- 

EF094-1 (SEQ ID NO:357) 

TAAACATTTG AGACATTCAG AGGTGAATGT CTCTTTTTTA TTACTCAAAA ACGAAAGGGG 
ATTAATTATA TGAAAAAAAC AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA 
TTAAGTCAAA CAATTGGCGG AACGATTGGT CCTACGATTG CTTTTGCCGA TGAAATTACT 
CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
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GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AAGTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCG AGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA' AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG 
ACTTTGGCTT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA CATTCCCGCA AAC TGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC 
TTTATCTTGA TTTTTTCGAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA 

EF094-2 (SEQ ID NO:358) 

MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 
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YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE I^STFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKS IV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF 
ILIFSTAGYY FWNRRN 

EF094-3 (SEQ ID NO:359) 

CGA TGAAATTACT 

CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GGAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAG 

EF094-4 (SEQ ID NO:360) 

DEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 

YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK RLGGASVDIK 

SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 

GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
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SLKIKKIDKE SGDIVPETVF HLDFGKALPS 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK 
RKDSPAGEIV QEITTDEKGR AETPKELANA 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK 
SEAFKTELVK GTKASDET 



KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 



EF095-1 (SEQ ID NO:361) 



TAAGAATTGT TGGATTGTTC TTTAGAAAGA 
GAATTGATAG TAACGGGCAT CTGCCATATA 
GTTTTTGCAG AAACATTACC AAGTACAAAA 
ACAGCAGAAA AAGCCGAAAG TGAACAACCA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC 
ATTAGAGAAA GAATTGAGAC GCCTAACCTA 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT 
CCAGATGGCA TAAATGTGTG GGAAGGTGAA 
AATTTAAAAG AAGTGGTAAT TCCAAGTGAG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT 
TACAATAAGA AAGGGGAAAT TGATCCCAAT 
GGAAACCAAT ATCCAACAAC AATTTCGCAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC 
TTGTACAACC AAGTGAAGGT TGATTCATCG 
TTTTCAGGGC CGGTTTATTA TCATGTTACC 
ACTCAAGGGA AACCAATCCC TCCACCACCG 
GAGCGTGACC CTTACACCTT TAAACAGAAA 
TCAAAAACGT ATCAATTTCA AGGATGGTAT 
AAAAGCGTAA CGCCCAGTTA TGATATTACC 
TATAAGGAGA TACCTCAAAA AAATTATACA 
CCACCATCTG ATTTTATTCA GGATCACCAA 
TTAGCTGGAA AAAAACTGCC ACAACAATAG 
GGTTGGTATC AAGATAAAAC NAAACAAGAG 
TCCCCTGTTT TTAATGAAAT GAACGCTATT 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA 
CAGATTATGC TTACAAATGT GGGAGAAGTA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT 
GATCAAGTGT TACAAGCGGC TGTTGAAATG 
GATACTGTCA GAATCCAACC TAAAAATCAA 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA 
AAAAAATCAC AACCCAATTG GGCACTAACT 
GATCAACTAT CATCAATGAC AAAGTTATTG 
CAGTACAATC AACCAACGGA AACTAAAGTT 
GTTGCCAACG GTGTAGCTAG CCATATTGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC 
AAAGATCAAA CTTATCAAGC AATGGTGACT 



AGGGACAATA TGAAGCGAAG TAAATGGAAA 
TTAGTATTCC CCATACTAAT ACAGACAACT 
CAAGTAAGAG AAGGAACCAA TCATTCATTA 
CAGACAAAGG ATAAACTACA TGATGAAGAA 
GATAATGAGG CTAATGTTAC AAGTCAAACG 
ACTTATCGTT ATGGATTTAT TAATGAAGAG 
CTACAGTATC ATAGTTGGCA AGGCAATTCC 
AGTCAACCAG TGACAGCATC TACAGTGGCT 
AAAGTAGCCG TCTATTCCGA CATGTCAACG 
TTACCAAGAT ATTATACTTC TTTAAGCTTA 
TATCCGCTGC CAACTATTTC CGACGCATCA 
TTTGAATTGG AAAAAATGTC TGCACAACAA 
ATTAGCGAGA GTCAAAAACT AATCGTTCCT 
AATCAATCTG GGCTATTGAA TTACTTTAAA 
AATCGCAAAG TGACAGAACA TTTTGTGGAT 
GGGTTTAGAC AAGGAAAGCA AACACTTATT 
GATCTTTTGC CAAGTAGCTA TGAAATTGAC 
AAAGGGAAAA CGAAACCTGA AAATTTAGAA 
TATGACGACA ATGATGATTT AACTGTTGTC 
TTTGAGGATG TCAATGGTGT TGAAATTGCA 
CAACCAATAA CTACGGATGG CTTTCGCTAT 
AGCGTTAACG GTAAAACTTA TTTATATCAA 
AGCTTAGAAA AAACGAAGCG ACCCATAAAC 
ACAGCAGTGT ATAAGGAAAT AACTGCAAAA 
GTCATGCCAA GTGGTTATAT ACAAATTTGG 
CCGTTAAAAA AAATAAACTT AAAGCCAGCA 
ATCCAAGTCA CGATTCGTGT TGGATCTGAA 
AATTGGCGAG TTGGCATTAC TTTAAATACG 
ATGATGACAA CAATTGCTAC AGGTGAACCA 
AATGGAAATT TTTCTGCTGT TCACGCAGCT 
GAAATTGTGG CACCAGATGA GGAAGGTTTT 
GTCGCCATTT CTAGCAACAC GCAGCAACAT 
AATGGTCAGG AAAATCCATA TTTACGTTTG 
GCAGAACTAT CCCCCTTTGA AGGAAGAGTG 
TTAGGAACAA CCAATGTTTC AGGTTTTATT 
GCTCTTGGCA AAACAACCGC TATTCAATTA 
GCCAATGGTC AGTTTGACGA AAGTGATGTT 
AAATTAGAAA TTCCAGCAAA TCAAGGTAGA 
TGGAATTTAG TGACAGGCCC ATAA 



EF095-2 (SEQ ID NO:362) 

MKRSKWKE LIVTGICHIL VFPILIQTTV FAETLPSTKQ VREGTNHSLT 

AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFINEEG 

QPVNANEILL QYHSWQGNSP DGINVWEGES QPVTASTVAN LKEWIPSEK VAVYSDMSTV 
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LAASNQTFFL PRYYTSLSLY NKKGEIDPNY PLPTISDASG NQYPTTISQF ELEKMSAQQY 
SQKTGVTFNI SESQKLIVPL YNQVKVDSSN QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT 
QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD LLPSSYEIDS KTYQFQGWYK GKTKPENLEK 
SVTPSYDITY DDNDDLTWY KEIPQKNYTF EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL 
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES LEKTKRPINS PVFNEMNAIT AVYKEITAKA 
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKINLKPAS GWSPGLARPI QVTIRVGSEP 
NKIVPITDEN WRVGITLNTE VPIGQTATIM MTTIATGEPD QVLQAAVEMN GNFSAVHAAD 
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AISSNTQQHG LKQAADYYEN GQENPYLRLK 
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV 
ANGVASHIVA NGQFDESDVY QFDFSFDQIK LEIPANQGRK DQTYQAMVTW NLVTGP 

EF095-3 (SEQ ID NO:363) 

AAGTACAAAA CAAGTAAGAG AAGGAACCAA TCATTCATTA 

ACAGCAGAAA AAGCCGAAAG TGAACAACCA CAGACAAAGG ATAAACTACA TGATGAAGAA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC GATAATGAGG CTAATGTTAC AAGTCAAACG 
ATTAGAGAAA GAATTGAGAC GCCTAACCTA ACTTATCGTT ATGGATTTAT TAATGAAGAG 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT CTACAGTATC ATAGTTGGCA AGGCAATTCC 
CCAGATGGCA TAAATGTGTG GGAAGGTGAA AGTCAACCAG TGACAGCATC TACAGTGGCT 
AATTTAAAAG AAGTGGTAAT TCCAAGTGAG AAAGTAGCCG TCTATTCCGA CATGTCAACG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT TTACCAAGAT ATTATACTTC TTTAAGCTTA 
TACAATAAGA AAGGGGAAAT TGATCCCAAT TATCCGCTGC CAACTATTTC CGACGCATCA 
GGAAACCAAT ATCCAACAAC AATTTCGCAA TTTGAATTGG AAAAAATGTC TGCACAACAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC ATTAGCGAGA GTCAAAAACT AATCGTTCCT 
TTGTACAACC AAGTGAAGGT TGATTCATCG AATCAATCTG GGCTATTGAA TTACTTTAAA 
TTTTCAGGGC CGGTTTATTA TCATGTTACC AATCGCAAAG TGACAGAACA TTTTGTGGAT 
ACTCAAGGGA AACCAATCCC TCCACCACCG GGGTTTAGAC AAGGAAAGCA AACACTTATT 
GAGCGTGACC CTTACACCTT TAAACAGAAA GATCTTTTGC CAAGTAGCTA TGAAATTGAC 
TCAAAAACGT ATCAATTTCA AGGATGGTAT AAAGGGAAAA CGAAACCTGA AAATTTAGAA 
AAAAGCGTAA CGCCCAGTTA TGATATTACC TATGACGACA ATGATGATTT AACTGTTGTC 
TATAAGGAGA TACCTCAAAA AAATTATACA TTTGAGGATG TCAATGGTGT TGAAATTGCA 
CCACCATCTG ATTTTATTCA GGATCACCAA CAACCAATAA CTACGGATGG CTTTCGCTAT 
TTAGCTGGAA AAAAACTGCC ACAACAATAC AGCGTTAACG GTAAAACTTA TTTATATCAA 
GGTTGGTATC AAGATAAAAC NAAACAAGAG AGCTTAGAAA AAACGAAGCG ACCCATAAAC 
TCCCCTGTTT TTAATGAAAT GAACGCTATT ACAGCAGTGT ATAAGGAAAT AACTGCAAAA 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA GTCATGCCAA GTGGTTATAT ACAAATTTGG 
CAGATTATGC TTACAAATGT GGGAGAAGTA CCGTTAAAAA AAATAAACTT AAAGCCAGCA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA ATCCAAGTCA CGATTCGTGT TGGATCTGAA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA AATTGGCGAG TTGGCATTAC TTTAAATACG 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT ATGATGACAA CAATTGCTAC AGGTGAACCA 
GATCAAGTGT TACAAGCGGC TGTTGAAATG AATGGAAATT TTTCTGCTGT TCACGCAGCT 
GATACTGTCA GAATCCAACC TAAAAATCAA GAAATTGTGG CACCAGATGA GGAAGGTTTT 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA GTCGCCATTT CTAGCAACAC GCAGCAACAT 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA AATGGTCAGG AAAATCCATA TTTACGTTTG 
AAAAAATCAC AACCCAATTG GGCACTAACT GCAGAACTAT CCCCCTTTGA AGGAAGAGTG 
GATCAACTAT CATCAATGAC AAAGTTATTG TTAGGAACAA CCAATGTTTC AGGTTTTATT 
CAGTACAATC AACCAACGGA AACTAAAGTT GCTCTTGGCA AAACAACCGC TATTCAATTA 
GTTGCCAACG GTGTAGCTAG CCATATTGTT GCCAATGGTC AGTTTGACGA AAGTGATGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC AAATTAGAAA TTCCAGCAAA TCAAGGTAGA 
AAAGATCAAA CTTATCAAGC AATGGTGACT TGGAATTTAG TGACAGGCCC A 

EF095-4 (SEQ ID NO:364) 

STKQ VREGTNHSLT 
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AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFINEEG 
QPVNANEILL QYHSWQGNSP DGINVWEGES QPVTASTVAN LKEWIPSEK VAVYSDMSTV 
LAASNQTFFL PRYYTSLSLY NKKGEIDPNY PLPTISDASG NQYPTTISQF ELEKMSAQQY 
SQKTGVTFNI SESQKLIVPL YNQVKVDSSN QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT 
QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD LLPSSYEIDS KTYQFQGWYK GKTKPENLEK 
SVTPSYDITY DDNDDLTWY KEIPQKNYTF EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL 
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES LEKTKRPINS PVFNEMNAIT AVYKEITAKA 
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKINLKPAS GWSPGLARPI QVTIRVGSEP 
NKIVPITDEN WRVGITLNTE VPIGQTATIM MTTIATGEPD QVLQAAVEMN GNFSAVHAAD 
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AISSNTQQHG LKQAADYYEN GQENPYLRLK 
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV 
ANGVASHIVA NGQFDESDVY QFDFSFDQIK LEIPANQGRK DQTYQAMVTW NLVTGP 

EF096-1 (SEQ ID NO:365) 

TGAGGTGGCC AAGTTAAAAT GAAAAAATTA CAGTCACTTT TTATTGGAAT TATCGCTATT 

ATTGTCATCT TGTTTTTTGG CGTGCGCCAA TTGGAGAAAG CAAGTGGCAT GGCAGGAGCA 

GATACCTTGA CCATTTACAA TTGGGGGGAC TATATAGATC CGGCCTTGAT TAAGAAATTT 

GAAAAAGAAA CAGGCTATAA AGTCAATTAC GAAACCTTTG ATTCTAATGA AGCTATGTAT 

ACAAAAATTC AGCAAGGTGG CACAGCCTAT GATATTGCCA TTCCTTCTGA ATATATGATT 

CAAAAAATGA TGAAAGCGAA GATGCTTTTA CCACTTGATC ACAGCAAATT AAAAGGCTTA 

GAAAACATTG ATGCACGCTT TTTAGATCAA TCCTTTGATC CCAAAAATAA GTTTTCCGTT 

CCGTACTTCT GGGGCACGTT GGGGATTATT TATAATGATA AATTTATTGA CGGCCGTCAG 

ATCCAACATT GGGATGATTT ATGGCGCCCG GAATTAAAAA ATAATGTCAT GCTGATTGAT 

GGCGCTCGCG AAGTGTTAGG ATTATCTTTG AACAGTTTAG GCTATTCGTT AAACAGTAAA 

AACGACCAAC AATTACGTCA GGCTACCGAT AAGTTAAACC GATTAACGAA CAATGTCAAA 

GCAATTGTTG CCGATGAAAT CAAAATGTAC ATGGCTAATG AAGAAAGTGC AGTTGCTGTA 

ACTTTCTCTG GTGAAGCTGC TGAAATGCTA GAAAACAATG AACATCTACA TTATGTGATT 

CCCAGTGAAG GCTCTAATCT CTGGTTTGAT AACATTGTGA TGCCTAAGAC AGCCAAAAAT 

AAAGAGGGTG CCTATGCATT TATGAACTTT ATGTTACGAC CAGAAAATGC GGCACAAAAT 

GCAGAATATA TTGGTTATTC CACACCAAAT AAAGAAGCTA AAAAACTATT ACCAAAAGAA 

GTTGCCGAAG ATAAACAATT TTATCCAGAT GATGAAACTA TCAAACATTT AGAAGTTTAC 

CAAGACTTAG GTCAAGAATA CTTAGGAATT TATAACGATC TGTTCTTGGA GTTTAAGATG 
TATCGGAAAT AA 

EF096-2 (SEQ ID NO:366) 

MKKLQ SLFIGIIAII VILFFGVRQL EKASGMAGAD TLTIYNWGDY IDPALIKKFE 
KETGYKVNYE TFDSNEAMYT KIQQGGTAYD IAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE 
NIDARFLDQS FDPKNKFSVP YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG 
AREVLGLSLN SLGYSLNSKN DQQLRQATDK LNRLTNNVKA IVADEIKMYM ANEESAVAVT 
FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN IVMPKTAKNK EGAYAFMNFM LRPENAAQNA 
EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD ETIKHLEVYQ DLGQEYLG I Y NDLFLEFKMY 
RK 



EF096-3 ( SEQ ID NO:367) 

AAGTGGCAT GGCAGGAGCA 
GATACCTTGA CCATTTACAA TTGGGGGGAC 
GAAAAAGAAA CAGGCTATAA AGTCAATTAC 
ACAAAAATTC AGCAAGGTGG CACAGCCTAT 
CAAAAAATGA TGAAAGCGAA GATGCTTTTA 
GAAAACATTG ATGCACGCTT TTTAGATCAA 
CCGTACTTCT GGGGCACGTT GGGGATTATT 



TATATAGATC CGGCCTTGAT TAAGAAATTT 
GAAACCTTTG ATTCTAATGA AGCTATGTAT 
GATATTGCCA TTCCTTCTGA ATATATGATT 
CCACTTGATC ACAGCAAATT AAAAGGCTTA 
TCCTTTGATC CCAAAAATAA GTTTTCCGTT 
TATAATGATA AATTTATTGA CGGCCGTCAG 
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ATCCAACATT GGGATGATTT ATGGCGCCCG GAATTAAAAA ATAATGTCAT GCTGATTGAT 
GGCGCTCGCG AAGTGTTAGG ATTATCTTTG AACAGTTTAG GCTATTCGTT AAACAGTAAA 
AACGACCAAC AATTACGTCA GGCTACCGAT AAGTTAAACC GATTAACGAA CAATGTCAAA 
GCAATTGTTG CCGATGAAAT CAAAATGTAC ATGGCTAATG AAGAAAGTGC AGTTGCTGTA 
ACTTTCTCTG GTGAAGCTGC TGAAATGCTA GAAAACAATG AACATCTACA TTATGTGATT 
CCCAGTGAAG GCTCTAATCT CTGGTTTGAT AACATTGTGA TGCCTAAGAC AGCCAAAAAT 
AAAGAGGGTG CCTATGCATT TATGAACTTT ATGTTACGAC CAGAAAATGC GGCACAAAAT 
GCAGAATATA TTGGTTATTC CACACCAAAT AAAGAAGCTA AAAAACTATT ACCAAAAGAA 
GTTGCCGAAG ATAAACAATT TTATCCAGAT GATGAAACTA TCAAACATTT AGAAGTTTAC 
CAAGACTTAG GTCAAGAATA CTTAGGAATT TATAACGATC TGTTCTTGGA GTTTAAGATG 
TATCGGAAA 

EF096-4 (SEQ ID NO:368) 
SGMAGAD TLTIYNWGDY IDPALIKKFE 

KETGYKVNYE TFDSNEAMYT KIQQGGTAYD IAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE 
NIDARFLDQS FDPKNKFSVP YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG 
AREVLGLSLN SLGYSLNSKN DQQLRQATDK LNRLTNNVKA IVADEIKMYM ANEESAVAVT 
FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN IVMPKTAKNK EGAYAFMNFM LRPENAAQNA 
EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD ETIKHLEVYQ DLGQEYLGIY NDLFLEFKMY 
RK 

EF097-1 (SEQ ID NO:369) 

TAGAAGTATT CTAATTATCT ACATAGAGAG CGAGGGACAA GGAATATGAA GGAAAAAGAA 
ATGCATTCGC TCTTTTTTAA ACATAAGTTT GTGAAAGTAA CTCCCTATTT ACGTCGTTTT 
GGTCATCGTT TGAGTGGGAT GATTATGCCA AATTTGAGTA TTTTTATTGC GTGGAGCTTA 
TTGTCTTTGG TGGCTGGCTA TACGACTGGG AATCTACGGC TAGCTCTTTC TGAAGTCGAA 
ACGATAATGA TTCGAGTTGT TTTACCGATT CTAATTGGTT TTACAGGCGG AAAAATGTTC 
GAGGAACAAC GTGGCGGCGT TGTTGCTGCT ATTGCGACAG TGGGCGTGAT TGTTTCCACA 
GATGTTCCAC AGTTGTTTGG TGCTATGTTT ATTGGCCCTT TAGCAGGATA TACTTTCGCC 
AAAATTGAAC AAATTCTCTT ACCGAAAGTT AAAGAAGGCT ACGAGATGCT GACTAAAAAC 
TTTTTAGCAG GAATTGTGGG AGGACTGCTG TGCTGTTTTG GTATTCTGGT TGTAGCTCCG 
GCTGTTGAAA GCGCTAGTTT TTGGCTGTAT CAATTTTCTT CTTGGTTAAT TGAAGCCAAT 
CTTTTACCAT TGGTTCACGT TTTCTTAGAG CCCTTAAAAG TGTTATTTTT TAATAATGCG 
ATTAACCATG GCTTATTAAC GCCTCTAGGT TTAGAAGGTG CTAGTCAAAC AGGTCAGTCC 
ATTTTATTTC TATTGGAAAC AAACCCTGGA CCAGGCGTGG GCGTTTTGGT TGCTTTTCTG 
CTGTTTGGGC CTGTAGGACA ACGAAAAACA GCAGGAGGTG CCACCATGAT TCAACTGATT 
GGGGGCATTC ATGAAATTTA TTTTCCGTTT GTTTTGATGG ACCCGCGCTT ATTTTTAGCA 
GTAATTGCTG GAGGAATGAG TGGTACGCTT GTTTTTCAAA TATTTAATGT GGGTCTAAGT 
GCTCCAGCTT CGCCAGGTTC ATTGGTTGCG ATTTTAGCCA ATGCCCCGAC TGATGCGAGG 
CTGGCGGTTT TTAGCGGAAT TTTTGTTAGC TTTCTGTGCT CTTTTGCAAT AGCAAGCTTG 
TTATTAAAAC GTCAACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC 
CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA TGCAGGAATG 
GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT GAACTTGGAG 
ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC ATTAGTGGTC 
ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA TATGGTGAGT 
GTTCAAAATT TTTTAGAAAT TAAATCCTAT TACCCGCAAG TTTTAGCCAA ACTGACTGCT 
TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC GACAAAACAA 
ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAG GATCGCAAAC AATGGGAATG 
GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC TAAAGAGCCA 
CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA ACTGGCGCAA 
GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT TTTGAATAAG 
AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTTAA 
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EF097-2 (SEQ ID NO:370) 
MLTKNF LAGIVGGLLC CFGILWAPA 

VESASFWLYQ FSSWLIEANL LPLVHVFLEP LKVLFFNNAI NHGLLTPLGL EGASQTGQSI 
LFLLETNPGP GVGVLVAFLL FGPVGQRKTA GGATMIQLIG GIHEIYFPFV LMDPRLFLAV 
IAGGMSGTLV FQIFNVGLSA PASPGSLVAI LANAPTDARL AVFSGIFVSF LCSFAIASLL 
LKRQRGIEPV SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM 
PVTYQSVHQM KWQPKTLWI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY PQVLAKLTAS 
SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG VAIEVSKEPL 
ETVFFTKETT YWTRELAQA YHLDLTQQNL YWTSFLNKK EYQEWLEGGA DRCF 

EF097-3 (SEQ ID NO:371) 

ACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC 

CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA TGCAGGAATG 
GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT GAACTTGGAG 
ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC ATTAGTGGTC 
ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA TATGGTGAGT 
GTTCAAAATT TTTTAGAAAT TAAATCCTAT TACCCGCAAG TTTTAGCCAA ACTGACTGCT 
TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC GACAAAACAA 
ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAG GATCGCAAAC AATGGGAATG 
GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC TAAAGAGC C A 
CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA ACTGGCGCAA 
GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT TTTGAATAAG 
AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTT 

EF097-4 (SEQ ID NO:372) 

RGIEPV SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM 
PVTYQSVHQM KWQPKTLWI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY PQVLAKLTAS 
SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG VAIEVSKEPL 
ETVFFTKETT YWTRELAQA YHLDLTQQNL YWTSFLNKK EYQEWLEGGA DRCF 

EF098-1 (SEQ ID NO:373) 

TAAATGAAAA AGACAAAAGT AATGACATTG ATGGCAACCA CAACTTTAGG CGCACTGGCA 
CTTGTACCAA TGAGTGCATT AGCAGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGATT 
CAATTTGCAC CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT 
AAACCAATTA CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT 
CCGTTATCCA TTGACTATGC ATCTAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA 
AATATGACCT ACTATGCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA 
GGCCCAAACT TTGTTCAAGT CTCAGATAAT CGTGGGACTG AGACAGGTTG GACGCTAAAA 
GTAAAACAAA ATGGTCAATT CAAAACTGAA GCCAACCAAG AACTAACAGC GGCCAAAGTA 
ACTTTAAGCA ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA 
GCTACGATTG AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA 
GAAGGTGCGG GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT 
ATTTCATTAG AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT 
ACTTGGACTT TGACAGATAC ACCTGCTAAC ACAGGAAACT AA 



EF098-2 (SEQ ID NO:374) 

MKKTKVMTLM ATTTLGALAL VPMSALAVDG 
PITPVDPTDP TGPKPGTAGP LSIDYASSLS 



GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK 
FGEQTITSKN MTYYAETQKY KDNAGADQEG 
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PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRWSAS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSWG DS VDTAKTS I SLEVPGSTTK . YAKKYTTTFT 
WTLTDTPANT GN 

EF098-3 (SEQ ID NO:375) 



AGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGATT 

CAATTTGCAC CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT 

AAACCAATTA CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT 

CCGTTATCCA TTGACTATGC ATCTAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA 

AATATGACCT ACTATGCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA 

GGCCCAAACT TTGTTCAAGT CTCAGATAAT CGTGGGACTG AGACAGGTTG GACGC TAAAA 

GTAAAACAAA ATGGTCAATT CAAAACTGAA GCCAACCAAG AACTAACAGC GGCCAAAGTA 

ACTTTAAGCA ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA 

GCTACGATTG AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA 

GAAGGTGCGG GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT 

ATTTCATTAG AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT 
ACTTGGACTT TGACAGATAC ACCTGCTAAC ACAGGAAACT 



EF098-4 (SEQ ID NO:376) 



VDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK 

PITPVDPTDP TGPKPGTAGP LSIDYASSLS FGEQTITSKN MTYYAETQKY KDNAGADQEG 
PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRWSAS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSWG DS VDTAKTS I SLEVPGSTTK YAKKYTTTFT 
WTLTDTPANT GN 



EF099-1 (SEQ ID NO:377) 



TGATGTTGTA GAGGGCTGAT 
ATGAAGAAAT TAGGCAAGGT 
TTATTTTTAG GTGTATTTTC 
ACACCACAGG AAAAAGTAGC 
TTGCAGTTTG CTTCCGCTTG 
AGAATTCAAA GTGATTTATC 
TATGGAATTG GGTTAGGACA 
AAAAGTCAAA AAAAGGAATG 
GATGGTTCTG ATAGTGACTT 
GCGGTAGATA TTTTGAAGCT 
GTAAAAAGAA AGGCTAGTGC 
GGAGGTTCAG CCAATGTTGG 
ACTATTAATG GTGGTCAATG 
CTACAAATGA TGGGTACGGG 
AGTTCAATTG GTTGGACAGT 
GTCATTAATT TTGGTCAAGG 
GCAAGTGTTG AAGGTAAAAA 
ATTGTTGCTA AGTATTTTCG 
AGGAAATAG 



GAAATGTTTA TCAGTCTTCT 
TTTAATTGTT AGTTGTTTTA 
TTCTAGTGAA AGCGGAGATT 
ATTAGAAGTT TCTAACTACG 
GATTGGCAAT ATGGAACATG 
GTTTAATTCA GCGATAGCTT 
ATGGGATTCA GGACGAAGAG 
GAAATCAGTA GCTTTACAAA 
ACTTAAAAGA ATGTCTAAAT 
GTGGGAACGA GCTGGAACAA 
TAATAATTGG TATAAACGAC 
TGGAGGAAAA ATTGATGCCT 
TTATGGCTTA TCTGCTTTTT 
GCATATGTTT GCGAGTGAAA 
CATAAAGAAT CCAAATTATT 
TGGTGTGGCT ACTAGTATTT 
CAAGTTTACT ACTTATGAGC 
GACTTGGGGA TTAGATTTTC 



TTTTATTGAA AGGAGAGATC 
TTTTTATTCT TCCTTTTTTA 
CTTCCCAGTT TCAGCCCGCT 
TGACGTCACA TGGCGGAACG 
AAAGTGGATT AAATCCTGCT 
TTAATCCTTC GTTAGGCGGT 
TTAATTTATT AAATTTTGCA 
TGGATTTTGC GTGGAATAAG 
CAAAAGATGT GAATACACTT 
AAGATGATCC CGCAGAACAA 
TTTCTACAGG TTCCATGGGC 
TGGAAAAAGT GATGGGGCAA 
TTGTTGAAAA ACAAGGAGGT 
TTGGTAATGA TTATCCTTGG 
CAGATATTAA AGCAGGAGAT 
ATGGGCATAC TGGTGTAGTG 
AAAACGCTGA ACAAGGTCAA 
CACATGTGAC CAGCATAGTA 



EF099-2 (SEQ ID NO:378) 



MKCLS VFFLLKGEIM KKLGKVLIVS CFIFILPFLL FLGVFSSSES GDSSQFQPAT 
PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHESGLNPAR IQSDLSFNSA IAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA 
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VDILKLWERA GTKDDPAEQV KRKASANNWY KRLSTGSMGG GSANVGGGKI DALEKVMGQT 
INGGQCYGLS AFFVEKQGGL QMMGTGHMFA SEIGNDYPWS SIGWTVIKNP NYSDIKAGDV 
INFGQGGVAT SIYGHTGWA SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR 
K 

EF099-3 {SEQ ID NO:379) 

TAGTGAA AGCGGAGATT CTTCCCAGTT TCAGCCCGCT 

ACACCACAGG AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA TGGCGGAACG 
TTGCAGTTTG CTTCCGCTTG GATTGGCAAT ATGGAACATG AAAGTGGATT AAATCCTGCT 
AGAATTCAAA GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC GTTAGGCGGT 
TATGGAATTG GGTTAGGACA ATGGGATTCA GG AC GAAG AG TTAATTTATT AAATTTTGCA 
AAAAGTCAAA AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC GTGGAATAAG 
GATGGTTCTG ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT GAATACACTT 
GCGGTAGATA TTTTGAAGCT GTGGGAACGA GCTGG AACAA AAGATGATCC CGCAGAACAA 
GTAAAAAGAA AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG TTCCATGGGC 
GGAGGTTCAG CCAATGTTGG TGGAGGAAAA ATTGATGCCT TGGAAAAAGT GATGGGGCAA 
ACTATTAATG GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA ACAAGGAGGT 
CTACAAATGA TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA TTATCCTTGG 
AGTTCAATTG GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA AGCAGGAGAT 
GTCATTAATT TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC TGGTGTAGTG 
GCAAGTGTTG AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA ACAAGGTCAA 
ATTGTTGCTA AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC CAGCATAGTA 
AGGAAAT 

EF099-4 (SEQ ID NO:380) 
SES GDSSQFQPAT 

PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHESGLNPAR IQSDLSFNSA IAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA 
VDILKLWERA GTKDDPAEQV KRKASANNWY KRLSTGSMGG GSANVGGGKI DALEKVMGQT 
INGGQCYGLS AFFVEKQGGL QMMGTGHMFA SEIGNDYPWS SIGWTVIKNP NYSDIKAGDV 
INFGQGGVAT SIYGHTGWA SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR 
K 

EF100-1 (SEQ ID NO:381) 

TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA 
ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG 
CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG 
ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG 
CAATGA 

EF100-2 (SEQ ID NO:382) 

MKKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK 
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 

EF100-3 (SEQ ID NO:383) 

TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 



AACAAAAATA CGCAGGGTTT 
TATTGATTTT ACTTTTTGTC 
GCAATGAAGC AATCGTAAAA 
ATAAGACGCC TTCCTTAAAT 
ATAAATATAC AGCAGAAAAG 
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CAAT 



EF100-4 (SEQ ID NO:384) 



KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 



EF100-1 (SEQ ID NO:385) 



TANTTATGGC AATATGGAAG GAGTTTTATA 
ACATTATTAG AAATGTTGAT TGTCTTATTG 
CCTAACTTAG CGAAACATAA AGAAACAGTT 
ATTGTAGAAT CACAAATCGA GCTCTACACA 
GAATTAGTCA ACGAAGGCTA CATTACTAAA 
CAATGA 



ATGAAAAAGA AACAAAAATA CGCAGGGTTT 
ATTATTTCCG TATTGATTTT ACTTTTTGTC 
GATAAAAAAG GCAATGAAGC AATCGTAAAA 
CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAGCAGTTAG ATAAATATAC AGCAGAAAAG 



EF100-2 (SEQ ID NO:386) 



MKKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK 
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 



EF100-3 (SEQ ID NO:387) 



TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
CAAT 



EF100-4 (SEQ ID NO:388) 



KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 



EF101-1 (SEQ ID NO:389) 



TGAGGAGATG AAACGAAGAA AATGAAGAAG 
GTAATTGCGG TTGGGGGCAT CGTAACTGTG 
GCTGTCAAGC AAGCGCCTAA AGATGACTGG 
CAACAAATTT ATATTAACGG TGTCATCCAA 
CAAAAAATAA CAAAGGATCC AGAGATTAAG 
ACAGAATTAT TTACTTATGA AGATGAGGCG 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG 
GATAAATTTA ATAAAACTAA AGAAGAAGAC 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT 
GCGGATTTAG GAGCGAAGCA ATATATTTCC 
ATTCCAGAAG TAAAAGATGC CAATTCACCG 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG 
CTAACTTCTG TTTCCAACAA TGTGGTTGTG 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT 
CAAGCAACCA TTGATTTAGG CGATTTAGGG 
AAAGAGGGTG AACAGGCCTA CGTTTTAGTG 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA 
GACCGAGTGG TTATTTCTTC AAAAAAACCA 



AAAACGATAA TTATATTGGG GGCAGTTGCG 
AATGCGTTAA ATAAAAATGC ACAACAAGTA 
GGAATTGACT ATTTTGACGT TCCCGACTTG 
CCGGAACAAA TGGAAGCCTT TGCGCGTGAT 
GTGAAAAACG GCGATGTCGT AGATGCAGGC 
GTCACAAAAG AAATTGAGGC ACAACAAAAT 
AATATCTATA ATAAGTGGAA TCGGGCCATT 
CGCACGATGT CTGGTGATGA TTTAAATGAA 
GAAGAGATTA CCTTCACCAA TGAAACCTTA 
ACAAAGGCTA ATTTCAAAGG TCGTGTATCA 
ATTTTACGGT TAACTTCAGA AGATCTTTAT 
ACTAAAATTA GTGTTGGGCA AAAAGCTAAA 
GATGGCTCAA TTTCTTACAT CGATGATAAT 
GGCAATCCAG AGGGCGGCAC AACGATGTCT 
TTAGACAAAG TCAAAAATGG CTACCATATG 
GCGATTGAGT TAC CGAAAAA AGCGATTCAA 
AATGATTTTG GAACCATCAT TCGTCGTGAT 
ATGGCGATTG AATCTGGCTT AGAATCAGCC 
GTAAAAGTCG GTGATATTGT TGAATCAGAT 
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GCAGCGATTG CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC GTCGAAATAG 
EF101-2 (SEQ ID NO:390) 

MKKK TIIILGAVAV IAVGGIVTVN ALNKNAQQVA VKQAPKDDWG IDYFDVPDLQ 
QIYINGVIQP EQMEAFARDQ KITKDPEIKV KNGDWDAGT ELFTYEDEAV TKEIEAQQNS 
LAKLETKRAN IYNKWNRAID KFNKTKEEDR TMSGDDLNEQ YQTEVDAVDE EITFTNETLA 
DLGAKQYIST KANFKGRVSI PEVKDANSPI LRLTSEDLYL AGKVNEKDLT KISVGQKAKL 
TSVSNNWVD GSISYIDDNP PEGNSDAASG NPEGGTTMSS YSVKIALANL DKVKNGYHMQ 
ATIDLGDLGA IELPKKAIQK EGEQAYVLVN DFGTIIRRDV QVGQENGDKM AIESGLESAD 
RWISSKKPV KVGDIVESDA AIASDESATN ESMTDASK 

EF101-3 (SEQ ID NO:391) 

TAAAAATGC ACAACAAGTA 

GCTGTCAAGC AAGCGCCTAA AGATGACTGG GGAATTGACT ATTTTGACGT TCCCGACTTG 
CAACAAATTT ATATTAACGG TGTCATCCAA CCGGAACAAA TGGAAGCCTT TGCGCGTGAT 
CAAAAAATAA CAAAGGATCC AGAGATTAAG GTGAAAAACG GCGATGTCGT AGATGCAGGC 
ACAGAATTAT TTACTTATGA AGATGAGGCG GTCACAAAAG AAATTGAGGC AC AAC AAAAT 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG AATATCTATA ATAAGTGGAA TCGGGCCATT 
GATAAATTTA ATAAAACTAA AGAAGAAGAC CGCACGATGT CTGGTGATGA TTTAAATGAA 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT GAAGAGATTA CCTTCACCAA TGAAACCTTA 
GCGGATTTAG GAGCGAAGCA ATATATTTCC ACAAAGGCTA ATTTCAAAGG TCGTGTATCA 
ATTCCAGAAG TAAAAGATGC CAATTCACCG ATTTTACGGT TAACTTCAGA AGATCTTTAT 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG ACTAAAATTA GTGTTGGGCA AAAAGCTAAA 
CTAACTTCTG TTTCCAACAA TGTGGTTGTG GATGGCTCAA TTTCTTACAT CGATGATAAT 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT GGCAATCCAG AGGGCGGCAC AACGATGTCT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT TTAGACAAAG TCAAAAATGG CTACCATATG 
CAAGCAACCA TTGATTTAGG CGATTTAGGG GCGATTGAGT TACCGAAAAA AGCGATTCAA 
AAAGAGGGTG AACAGGCCTA CGTTTTAGTG AATGATTTTG GAACCATCAT TCGTCGTGAT 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA ATGGCGATTG AATCTGGCTT AGAATCAGCC 
GACCGAGTGG TTATTTCTTC AAAAAAACCA GTAAAAGTCG GTGATATTGT TGAATCAGAT 
GCAGCGATTG CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC GTCGAAAT 

EF101-4 (SEQ ID NO:392) 

KNAQQVA VKQAPKDDWG IDYFDVPDLQ 

QIYINGVIQP EQMEAFARDQ KITKDPEIKV KNGDWDAGT ELFTYEDEAV TKEIEAQQNS 
LAKLETKRAN IYNKWNRAID KFNKTKEEDR TMSGDDLNEQ YQTEVDAVDE EITFTNETLA 
DLGAKQYIST KANFKGRVSI PEVKDANSPI LRLTSEDLYL AGKVNEKDLT KISVGQKAKL 
TSVSNNWVD GSISYIDDNP PEGNSDAASG NPEGGTTMSS YSVKIALANL DKVKNGYHMQ 
ATIDLGDLGA IELPKKAIQK EGEQAYVLVN DFGTIIRRDV QVGQENGDKM AIESGLESAD 
RWISSKKPV KVGDIVESDA AIASDESATN ESMTDASK 

EF102-1 (SEQ ID NO:393) 

TAAACATTTG AGACATTCAG AGGTGAATGT CTCTTTTTTA TTACTCAAAA ACGAAAGGGG 

ATTAATTATA TGAAAAAAAC AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA 

TTAAGTCAAA CAATTGGCGG AACGATTGGT CCTACGATTG CTTTTGCCGA TGAAATTACT 

CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 

TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 

GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 

GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 

TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 

GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
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AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG 
ACTTTGGCTT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA CATTCCCGCA AACTGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC 
TTTATCTTGA TTTTTTCGAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA 

EF102-2 (SEQ ID NO:394) 

MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 
YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHS IK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
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SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF 
ILIFSTAGYY FWNRRN 



EF102-3 (SEQ ID NO:395) 



TT TAGATGAAAA GAACCAAGTT GCCGTTAJ 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT 
ACTGCCGAAA CTGGATTTAA CGACTTATCT 
GAAATCACAG GTGCTGAAGA TAAAGCGACC 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA 
GACGACTATG CGAAGAGTGA GTATGTCTTT 
ATTAAGATGG TGACCGTTCC TTACGAGAAA 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG 
TGGAAAGACG GAAATAAAAA ATTGAATACC 
AGATATAACT TGCATGAAAT CAAAGAAGAC 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT 
GATGGTTCGC AAACTTTTAC TCATGGTGAC 
ACCCATGATG TACTGGATGG CTCAAAAGAA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC 
AGTACGCCAA AACAACCGGA AACACCAGCT 
ACAGTGAAGA 



iAC ACCTAGCAAT TAACGAGTAT 
TATACTTTGG ATGAAACGAA GTATCCTGTA 
AATGCCGTAA TTACTCGAGA TGTTACGGCA 
TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGGGATTATT TACTTGAAGA AATAGAGGCT 
GAAATCCGTT CTACATTTAA GGAAAACAAA 
ACCATTACCG AAGAAGGACA AAAACAACCA 
CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AAAGAAGATA GTTTGACTTC TCTTGCGACT 
CTTGATTTTA CCGAGCTAGT TGATAAATTG 
TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACTGGAACTT GGAAAATTCT GCATAAATTA 
GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GTGATGGATA TGTTTGATGA TGTGTCGGTT 
GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGTAGATA CCGG AAAGTA TCCAGAAGGA 
GAAAAAGATG GAAACGTGAA TGGAAAACAC 
TTAACACCAA AAGAAGTGCC AACCATACCG 
GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 



EF102-4 (SEQ ID NO:396) 



LDEKNQVA VKHLAINEYF WQETKAPEGY T] 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW 
TATLANKEKT GTWKILHKLT AEQVLDKSIV 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK 



FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
YWAQAIDVE ATKAAQEKDE KAKPWIAET 
LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
MDMFDDVSVT HDVLDGSKEA FETILYALLP 
VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
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EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VK 
EF103-1 (SEQ ID NO:397) 

TAAGATAGGT TTATCAAAGA AAAGGAGCGA TGCTTTATGA AAAAGAAAGT ATTAAGTTCG 
ATT AC TTT AG TAACATTAAG TACGTTACTT ATAGCAGGTT ATGCAAGTCC AGCATTTGCA 
GATCATGCAG CCAATCCAAA TAGTGCTACA GCAAATTTAG GCAAACATCA AAACAATGGC 
CAAACAAGAG GCGACAAGGC GACTAAGATT TTATCTGGCA CGGACTGGCA AGGAACCCGT 
GTTTATGATG CTGCTGGTAA TGATTTAACG GCAGAAAATG CTAATTTTAT TGGTTTAGCA 
AAATATGATG GTGAAACCGG TTTTTACGAG TTTTTCGACA AAAATACTGG GGAAACCCGT 
GGTGACGAAG GAACATTTTT TGTGACAGGT GATGGCACAA AACGAATTTT AATTTCGCGG 
ACACAAAATT ATCAAGCCGT AGTGGATTTA ACCGAAGTGA GTAAAGACNA ATTTACTTAC 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT GATGTTGAAG TTTACGTGGA ACACATCCCT 
TATCATGGGA AAAAATTAGC TTTTACAAAT GGACGTGAAG CATTAACCAA TCAAACTGGC 
AAAATTGTGA CAAATAAATC AGGGGATAAA ATTTTAGGAA CAACCTTGTG GAATGGCACA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG ACAGCGGCCA ATCAAAATTT CATTAGTTTA 
GCGAAATTTG ATCCAAACAC AAGTAAATAT GAATTTTTCA ATTTACAAAC AGGTGAAACC 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG GACAATAACA AGATTCGGGC CCATGTATCT 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA GAATTAACGG AACTAAACAA TGATCGATTT 
ACGTATACTC GAATGGGTAA AGATAATGCT GGTAATGATA TTCAAGTGTT CGTGGAACAT 
GAACCTTACC AAGGCACATA TCATCCAGCC TTTACTTTCT AA 

EF103-2 (SEQ ID NO:398) 

MKKKVLSSI TLVTLSTLLI AGYASPAFAD HAANPNSATA NLGKHQNNGQ 
TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK YDGETGFYEF FDKNTGETRG 
DEGTFFVTGD GTKRILISRT QNYQAWDLT EVSKDXFTYK RLGKDKLGND VEVYVEHIPY 
HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK WDKNGNDVT AANQNFISLA 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNKIRAHVSI GTNRYGAALE LTELNNDRFT 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPAF TF 

EF103-3 (SEQ ID NO:399) 

TCATGCAG CCAATCCAAA TAGTGCTACA GCAAATTTAG GCAAACATCA AAACAATGGC 
CAAACAAGAG GCGACAAGGC GACTAAGATT TTATCTGGCA CGGACTGGCA AGGAACCCGT 
GTTTATGATG CTGCTGGTAA TGATTTAACG GCAGAAAATG CTAATTTTAT TGGTTTAGCA 
AAATATGATG GTGAAACCGG TTTTTACGAG TTTTTCGACA AAAATACTGG GGAAACCCGT 
GGTGACGAAG GAACATTTTT TGTGACAGGT GATGGCACAA AACGAATTTT AATTTCGCGG 
ACACAAAATT ATCAAGCCGT AGTGGATTTA ACCGAAGTGA GTAAAGACNA ATTTACTTAC 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT GATGTTGAAG TTTACGTGGA ACACATCCCT 
TATCATGGGA AAAAATTAGC TTTTACAAAT GGACGTGAAG CATTAACCAA TCAAACTGGC 
AAAATTGTGA CAAATAAATC AGGGGATAAA ATTTTAGGAA CAACCTTGTG GAATGGCACA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG ACAGCGGCCA ATCAAAATTT CATTAGTTTA 
GCGAAATTTG ATCCAAACAC AAGTAAATAT GAATTTTTCA ATTTACAAAC AGGTGAAACC 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG GACAATAACA AGATTCGGGC CCATGTATCT 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA GAATTAACGG AACTAAACAA TGATCGATTT 
ACGTATACTC GAATGGGTAA AGATAATGCT GGTAATGATA TTCAAGTGTT CGTGGAACAT 
GAACCTTACC AAGGCACATA TCATCCAGCC T 

EF103-4 (SEQ ID NO:400) 

HAANPNSATA NLGKHQNNGQ 

TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK YDGETGFYEF FDKNTGETRG 
DEGTFFVTGD GTKRILISRT QNYQAWDLT EVSKDXFTYK RLGKDKLGND VEVYVEHIPY 
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HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK WDKNGNDVT AANQNFISLA 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNKIRAHVSI GTNRYGAALE LTELNNDRFT 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPA 

EF104-1 (SEQ ID NO:401) 

TGAAAGGGGA TTAGTATGAA GAAAAAAACT TTTTCTTTTG TGATGTTGAG TATACTTCTC 
GCACAAAATT TCGGGTTTGC CGTAAATGCC TATGCTGTAA CAACGACAGA AGCACAAACA 
GAGACCACTG ATACAGCAAA AAAAGAGGCA GAGTTATCGA ACTCAACACC ATCTTTACCT 
TTAGCAACAA CGACTACTTC AGAAATGAAT CAACCAACTG CAACAACTGA ATCGCAAACC 
ACAGAGGCGA GCACAACAGC TTCCAGTGAT GCTGCTACAC CATCTGAACA ACAAACAACG 
GAGGACAAGG ACACCTCACT TAATGAAAAA GCCCTGCCAG ATGTTCAAGC GCCAATTACA 
GATGAACTAC TTGACAGTAT GAGTCTTGCG CCGATTGGTG GAACAGAATA CAGCCAAACA 
GAGGTTCACC GCGAATTAAA TACAACACCG GTAACCGCTA CGTTCCAATT TGCTGTTGGA 
AACACAGGTT ATGCACCTGG ATCAGTTTAT ACAGTTCAAT TACCAGAACA TTTAGGTTAT 
TCAACTGTCA GCGGAGAAGT GACAGGCATT GGCGCAACTT GGGCAGTCGA TGCGGCGACC 
AAAACATTAA GTATTACGTT TAATCAACGA GTTTCAGATA CTTCCTTTAA AGTAGAACTA 
AAAAGTTATC TAACAACAGA GGCGGAACCA TTAATCAAAA TTGAAACTCC AGGAAAAAAT 
AAAAAAACCT ACTCGTTTGA TTTATATGAA CAAGTGGAAC CAATTCAATA TAACGAACGA 
ACCAGAACGA CGGGGTTAGA TGGCGAAATT TTTTATAATT TAGACCGGAC GTTAACTGGC 
AATCAAACAT TAGAATTATT AACAACAGAG ACGCCAGGCG CTGTCTTTGG AAAACAAGAT 
AACTTGGAAC CTCAAGTTTT CAGTTACGAT GTCGACATTA ATGGTCAAAT TTTACCAGAA 
ACGCAAACCT TGTTAACACC TGGCAAAGAT TATACATTAA GCGATAATTC ACTCGGGCGG 
ATTGCTGTAA CTGTTCCAAA CATGAATCAA CAAAAAGCCT ATTCCTTATC GATTAATCGG 
ACAATTTATT TAGAGAGTGC TTCGGACTAT AACTACTTAT ATTCGCAGCA GTATCCAACA 
ACAAAAATTG GGTCAATTTC TTTGAAAAGT ACGACAGGAA CTAAACAAAC AACCGATTTT 
ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA ATTGCTGATC GTGAAATGCG TAGTATGTCC 
TATATCAGTT TTCAAAGCAA AGGGAAATAT TATGTAACAA TTTATGGCAC GTTAACAGAA 
ACAAAAGTGG GTCAACAAAT CGTATTAGAG AGTACAAACG GTCAAGAAAT TAAGAATCCT 
AAATTTACGG CGTATGGTCC TTTATATGAA AATGTAAAAT TGGAAGACTA TTTTGATATT 
AAAACTGAAG GTGGCAAGCT CACTTTAACG GCCACAAAAG ATAGCTATTT AAGAATAAAT 
ATTTCTGATT TAACAATGGA TTTTGACAAG AAGGACATTA ATCTATCATT AAGTACACCT 
GTAATTGGTC CTAATAAAGC CATTCAATTA GTATCCGATC AATATATTGA ACCAATTAGT 
GTTGTTAATC CTTTGAATGC TGAAACTGCT TGGGGTAATT ATGATCAAAA TGGTGCCTAT 
TCATCAAGAA CAACTGTCTC AGTTATGGGA AGCAAAGAGA AACCGATTCA AAATTTAGAA 
ATTAAAGTAA AGCATCCTAA TTATCTTTCA TTACGAGCTA CAAAAGAAAT TTATTTTTAT 
TACAAGTTAG GAACGGATTA TACAGTAACG CCAACGTCAG ATGGTTCAGT TATTAAGTTC 
ACTACGCCAA TAACCAACGA AATCCAAATT CCAATTGGTT TTAATTATGT GCCAGATAGT 
TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT ACGATACCGA TAACAATGAG TGCTGAAGGT 
TTAACTCCAG TTGATACGAC AGTAACTACT AATAGTAAGC GTGGTTCTGA ACGAACACTT 
CAAAGTAGTA AAAATCAATT CCTTGTCAAT GCACGAAATG ATTCTTTTGA CTCACTAAGC 
GTCCGTACAA AAATTCCAGC TGGCGCCGAT GTTCTTTTTG ACATTTATGA TGTTTCAAAC 
GATCAGGTAG ATTCAATTTA TCCACAATAC TGGGACCGCG GTCAATACTT TGATAAACCA 
ATGACGCCAA ACAGCCCTGG ATATCCAACG ATTACTTTTG ACGAAAATAC CAATAGTTAC 
ACGTTTGATT TTGGAAAAAC CAACAAACGT TACATTATTG AGTATAAAAA CGCCAATGGC 
TGGATCGACG TGCCAACTCT TTATATAACA GGGACAGCGA AAGAACCACA ATCGAATAAT 
AATGAAGGCT CTGCTTCGGT TTCTGTTCAA AATGAAGCGT TAGACATTTT GAGTGCAACA 
CAAGCGGCGA ATCCAACATT AAAAAATGTA ACAAAAACGA CAGTAACAAC AAAAAATATT 
GATAATAAAA CACATCGTGT GAAAAATCCA ACGATTGAAT TAACACCAAA AGGCACAACC 
AATGCTCAAA TCGATTTGAA TTCTATTACC GTGAAAGGCG TGCCAGAAGA TGCTTATTCA 
TTAGAGAAGA CTACAAACGG TGCGAAAGTC ATTTTTAAAG ACTATACATT GACAGAAAAC 
ATTACGATTG AATACAATAC GGTCTCTGCA AACGCTGGCC AAATCTATAC AGAAACAACA 
ATCGACTCTG AAACATTGAA CCAGATGTCT GCTAGCAAGA AAAAAGTCAC CACTGCGCCA 
ATCACATTGA AATTCTCAGA AGGTGATGCG GAAGGTATTG TTTATTTAGC AACTGCCACA 
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TTCTACACGC ATAACGTAGA GGATGAAAAC CAAGCAATTG CGAAGGTTTC TTTTGAACTA 
ATTGATAATG TCACGCATAC AGCAACCGAA TTTACAACAG ATGAAAAAGG TCAATACTCC 
TTTGATGCCA TCATGACAGG TGATTATACT TTGCGAGTAA CGAATGTACC GCAGGAATAT 
TCCGTGGATG AAGAGTATTT GACAGGAAAA GCCATTAAGC TGGTCAAAGG AGACAACCAA 
CTAAAAATTC CATTAACGAA AACAATTGAT CACAGTCGTT TACAAGTCAA AGATTCAACG 
ATTTATGTCG GCGATTCATG GAAACCAGAA GAGAACTTTG TTTCAGCAAC AGATAAAACA 
GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA 
GGCGTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA TGTGACCGTC 
AAACCCGACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG TGATTCGTGG 
AAACCAGAAG ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA 
AAAATTGATG TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT 
AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG 
GTTAAAGATA CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA 
GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT 
GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA 
GCTAATGTGA CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT 
GTGGGCGATA AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA 
GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATGTTGATAA AATAGGCGAT 
TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT 
GACAGTCAGT TAGAGGTCAA AGATACAACA ATTTATGTGG GTGATAAATG GGAAGCAGAA 
GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT 
GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC 
ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT 
ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACAGAT 
AAAACAGGTC AAGATGTCCC ATTCGAAAAA ATCACTGTT 

EF104-2 (SEQ ID NO:402) 

MKKKTF SFVMLSILLA QNFGFAVNAY AVTTTEAQTE TTDTAKKEAE LSNSTPSLPL 
ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA LPDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT VQLPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL IKIETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNMNQQ KAYSLSINRT 
IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN VKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV SDQYIEPISV 
VNPLNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL RATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT IPITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY IIEYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT KTTVTTKNID 
NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL RVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE NFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD TTIYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV HVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF EKITVSGQVD TSKAGVYPIV YSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG TVNVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP EXNFVSATDK 
TGQDVPFEKI TV 
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EF104-3 (SEQ ID NO:403) 
TGTAA CAACGACAGA AGCACAAACA 

GAGACCACTG ATACAGCAAA AAAAGAGGCA GAGTTATCGA ACTCAACACC ATCTTTACCT 

TTAGCAACAA CGACTACTTC AGAAATGAAT CAACCAACTG CAACAACTGA ATCGCAAACC 

ACAGAGGCGA GCACAACAGC TTCCAGTGAT GCTGCTACAC CATCTGAACA ACAAACAACG 

GAGGACAAGG ACACCTCACT TAATGAAAAA GCCCTGCCAG ATGTTCAAGC GCCAATTACA 

GATGAACTAC TTGACAGTAT GAGTCTTGCG CCGATTGGTG GAACAGAATA CAGCCAAACA 

GAGGTTCACC GCGAATTAAA TACAACACCG GTAACCGCTA CGTTCCAATT TGCTGTTGGA 

AACACAGGTT ATGCACCTGG ATCAGTTTAT ACAGTTCAAT TACCAGAACA TTTAGGTTAT 

TCAACTGTCA GCGGAGAAGT GACAGGCATT GGCGCAACTT GGGCAGTCGA TGCGGCGACC 

AAAACATTAA GTATTACGTT TAATCAACGA GTTTCAGATA CTTCCTTTAA AGTAGAACTA 

AAAAGTTATC TAACAACAGA GGCGGAACCA TTAATCAAAA TTGAAACTCC AGGAAAAAAT 

AAAAAAACCT ACTCGTTTGA TTTATATGAA CAAGTGGAAC CAATTCAATA TAACGAACGA 

ACCAGAACGA CGGGGTTAGA TGGCGAAATT TTTTATAATT TAGACCGGAC GTTAACTGGC 

AATCAAACAT TAGAATTATT AACAACAGAG ACGCCAGGCG CTGTCTTTGG AAAACAAGAT 

AACTTGGAAC CTCAAGTTTT CAGTTACGAT GTCGACATTA ATGGTCAAAT TTTACCAGAA 

ACGCAAACCT TGTTAACACC TGGCAAAGAT TATACATTAA GCGATAATTC ACTCGGGCGG 

ATTGC TGTAA CTGTTCCAAA CATGAATCAA CAAAAAGCCT ATTCCTTATC GATTAATCGG 

ACAATTTATT TAGAGAGTGC TTCGGACTAT AACTACTTAT ATTCGCAGCA GTATCCAACA 

ACAAAAATTG GGTCAATTTC TTTGAAAAGT ACGACAGGAA CTAAACAAAC AACCGATTTT 

ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA ATTGC TGATC GTGAAATGCG TAGTATGTCC 

TATATCAGTT TTCAAAGCAA AGGGAAATAT TATGTAACAA TTTATGGCAC GTTAACAGAA 

ACAAAAGTGG GTCAACAAAT CGTATTAGAG AGTACAAACG GTCAAGAAAT TAAGAATCCT 

AAATTTACGG CGTATGGTCC TTTATATGAA AATGTAAAAT TGGAAGACTA TTTTGATATT 

AAAACTGAAG GTGGCAAGCT CACTTTAACG GCCACAAAAG ATAGCTATTT AAGAATAAAT 

ATTTCTGATT TAACAATGGA TTTTGACAAG AAGGACATTA ATCTATCATT AAGTACACCT 

GTAATTGGTC CTAATAAAGC CATTCAATTA GTATCCGATC AATATATTGA ACCAATTAGT 

GTTGTTAATC CTTTGAATGC TGAAACTGCT TGGGGTAATT ATGATCAAAA TGGTGCCTAT 

TCATCAAGAA CAACTGTCTC AGTTATGGGA AGCAAAGAGA AACCGATTCA AAATTTAGAA 

ATTAAAGTAA AGCATCCTAA TTATCTTTCA TTACGAGCTA CAAAAGAAAT TTATTTTTAT 

TACAAGTTAG GAACGGATTA TACAGTAACG CCAACGTCAG ATGGTTCAGT TATTAAGTTC 

ACTACGCCAA TAACCAACGA AATCCAAATT CCAATTGGTT TTAATTATGT GCCAGATAGT 

TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT ACGATACCGA TAACAATGAG TGCTGAAGGT 

TTAACTCCAG TTGATACGAC AGTAACTACT AATAGTAAGC GTGGTTCTGA ACGAACACTT 

CAAAGTAGTA AAAATCAATT CCTTGTCAAT GCACGAAATG ATTCTTTTGA CTCACTAAGC 

GTCCGTACAA AAATTCCAGC TGGCGCCGAT GTTCTTTTTG ACATTTATGA TGTTTCAAAC 

GATCAGGTAG ATTCAATTTA TCCACAATAC TGGGACCGCG GTCAATACTT TGATAAACCA 

ATGACGCCAA ACAGCCCTGG ATATCCAACG ATTACTTTTG ACGAAAATAC CAATAGTTAC 

ACGTTTGATT TTGGAAAAAC CAACAAACGT TACATTATTG AGTATAAAAA CGCCAATGGC 

TGGATCGACG TGCCAACTCT TTATATAACA GGGACAGCGA AAGAACCACA ATCGAATAAT 

AATGAAGGCT CTGCTTCGGT TTCTGTTCAA AATGAAGCGT TAGACATTTT GAGTGCAACA 

CAAGCGGCGA ATCCAACATT AAAAAATGTA ACAAAAACGA CAGTAACAAC AAAAAATATT 

GATAATAAAA CACATCGTGT GAAAAATCCA ACGATTGAAT TAACACCAAA AGGCACAACC 

AATGCTCAAA TCGATTTGAA TTCTATTACC GTGAAAGGCG TGCCAGAAGA TGCTTATTCA 

TTAGAGAAGA CTACAAACGG TGCGAAAGTC ATTTTTAAAG ACTATACATT GACAGAAAAC 

ATTACGATTG AATACAATAC GGTCTCTGCA AACGCTGGCC AAATCTATAC AGAAACAACA 

ATCGACTCTG AAACATTGAA CCAGATGTCT GCTAGCAAGA AAAAAGTCAC CACTGCGCCA 

ATCACATTGA AATTCTCAGA AGGTGATGCG GAAGGTATTG TTTATTTAGC AACTGCCACA 

TTCTACACGC ATAACGTAGA GGATGAAAAC CAAGCAATTG CGAAGGTTTC TTTTGAACTA 

ATTGATAATG TCACGCATAC AGCAACCGAA TTTACAACAG ATGAAAAAGG TCAATACTCC 

TTTGATGCCA TCATGACAGG TGATTATACT TTGCGAGTAA CGAATGTACC GCAGGAATAT 

TCCGTGGATG AAGAGTATTT GACAGGAAAA GCCATTAAGC TGGTCAAAGG AGACAACCAA 

CTAAAAATTC CATTAACGAA AACAATTGAT CACAGTCGTT TACAAGTCAA AGATTCAACG 
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ATTTATGTCG GCGATTCATG GAAACCAGAA GAGAACTTTG TTTCAGCAAC AGATAAAACA 
GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA 
GGC GTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA TGTGACCGTC 
AAACCCGACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG TGATTCGTGG 
AAACCAGAAG ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA 
AAAATTGATG TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT 
AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG 
GTTAAAGATA CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA 
GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT 
GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA 
GCTAATGTGA CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT 
GTGGGCGATA AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA 
GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATGTTGATAA AATAGGCGAT 
TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT 
GACAGTCAGT TAGAGGTCAA AGATACAACA ATTTATGTGG GTGATAAATG GGAAGCAGAA 
GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT 
GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC 
ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT 
ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACAGAT 
AAAACAGGTC AAGATGTCCC ATTC 

EF104-4 (SEQ ID NO:404) 

VTTTEAQTE TTDTAKKEAE LSNSTPSLPL 

ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA LPDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT VQLPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL IKIETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNMNQQ KAYSLSINRT 
IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN VKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV SDQYIEPISV 
VNPLNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL RATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT IPITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY IIEYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT KTTVTTKNID 
NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL RVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE NFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD TTIYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV HVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF EKITVSGQVD TSKAGVYPIV YSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG TVNVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP EXNFVSATDK 
TGQDVPF 

EF105-1 (SEQ ID NO:405) 

TAAATGAAAA AAACAGTCGT CTACTCCTTG TTATTCGGAA CAATGTTGCT TGGCGCCACT 
GTTCCTGCTG AAGCGGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT TTTTACCCCA 
AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTCGA 
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CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC 
GATTATGCCT CAAGTTTGGA TTTTGGGAGT AATGAGATAT CGAATAAGGA TCAAACGTAT 
TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT 
AATTATGTAC AAGTAAGTGA TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA 
CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG CGCCACCGTC 
GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCGACGGACG TATTGCCGCC AACTGCTACC 
GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAAGC CCCAGAAAAG 
ACCGGCGCCG GAACGTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT 
CAACAAGGAC AGCAAGTAAA TGCCACAATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG 
AAAACCCCTA AGGATGCAGT ACAATATAAA ACAACATTGA CTTGGCTACT TTCAGATGTA 
CC AG TAAATA ATGGAGGGAA ATAA 



EF105-2 (SEQ ID NO:406) 

MKKTWYSLL FGTMLLGATV PAEAATWFD 
VDPTNPDGPN PGTPGPLSID YASSLDFGSN 
YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL 
NIQLDAAGAE TWMQAPEKT GAGTWITLWG 
TPKDAVQYKT TLTWLLSDVP VNNGGK 

EF105-3 (SEQ ID NO:407) 



SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP 
EISNKDQTYF ARAQTYRNPD GSASELATAN 
HKELTGATVA FTEPSVRSNA TDVLPPTATA 
QAEKVTEKNQ QGQQVNATIT RAISLTVPGK 



GGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT TTTTACCCCA 

AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTCGA 
CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC 
GATTATGCCT CAAGTTTGGA TTTTGGGAGT AATGAGATAT CGAATAAGGA TCAAACGTAT 
TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT 
AATTATGTAC AAGTAAGTGA TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA 
CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG CGCCACCGTC 
GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCGACGGACG TATTGCCGCC AACTGCTACC 
GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAAGC CCCAGAAAAG 
ACCGGCGCCG GAACGTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT 
CAACAAGGAC AGCAAGTAAA TGCCACAATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG 
AAAACCCCTA AGGATGCAGT AC 

EF105-4 (SEQ ID NO:408) 

ATWFD SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP 

VDPTNPDGPN PGTPGPLSID YASSLDFGSN EISNKDQTYF ARAQTYRNPD GSASELATAN 
YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL HKELTGATVA FTEPSVRSNA TDVLPPTATA 
NIQLDAAGAE TWMQAPEKT GAGTWITLWG QAEKVTEKNQ QGQQVNATIT RAISLTVPGK 
TPKDAV 

EF106-1 (SEQ ID NO:409) 

TAGTCGTTTA TGAAGAAAAA AATCGTTGGT ACAATTACGT TGTTGGCTTT AAGTGCGTTA 
TTAGTTGGTG GAGCAGGAGG GGCTTTGACG GCAGAAGCAT ACGTTCCTCA AAGCGTAGAC 
AATCCCAATA ATTTAGGGGA TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT 
GAAGGATTAT CAGAAAAAGA TTGGGCTGGA ACACGCGTTT ATG ATC G AAA TGGGAATGAC 
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TATGAATTTT TTGATAAAGA GACTGGAGAA TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAACCAA AAATTATCAA 
GGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT 
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TATGGTCGTC CGCAGCCATA CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA 
CGCCGTGCCA TGACAGAACA AACAGGCCAG ATTGATGTAA ATCGACAAGG GGATGAAATT 
ATTGGTAAAA CTTCCTTTGA TGGGACACCG CAACTTCTTT GGAATGGCAC AAAAGTAGTG 
GATAAAGATG GCAATGACGT AACTTCGGCC AACCAAAACT TTATCAGCTT AGCGAAATTT 
GACCAAGATA GCAGCAAATA TGAATTTTTC AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
TATGGCTACT TTAAAGTAGG AAATCAAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC 
AATCGCTATG GCGCTGTCTT AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA 
CGAATGGGTA AAGATAACGA AGGAAACGAT ATCCAAGTCT ATGTGGAACA TGAACCATAC 
CAAGGAACTT TTAATCCTGA ATTTACCTTT TAA 

EF106-2 (SEQ ID NO:410) 

MKKKIVGT ITLLALSALL VGGAGGALTA EAYVPQSVDN PNNLGDLPEY LRSVGIRQDE 

GLSEKDWAGT RVYDRNGNDL TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT 

AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 

GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI DVNRQGDEI I GKTSFDGTPQ LLWNGTKWD 

KDGNDVTSAN QNFISLAKFD QDSSKYEFFN LQTGETRGDY GYFKVGNQNK FRAHVSIGTN 

RYGAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GTFNPEFTF 

EF106-3 (SEQ ID NO:411) 

AT ACGTTCCTCA AAGCGTAGAC 

AATCCCAATA ATTTAGGGGA TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT 
GAAGGATTAT CAGAAAAAGA TTGGGCTGGA ACACGCGTTT ATGATCGAAA TGGGAATGAC 
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TATGAATTTT TTGATAAAGA GACTGGAGAA TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAACCAA AAATTATCAA 
GGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT 
TATGGTCGTC CGCAGCCATA CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA 
CGCCGTGCCA TGACAGAACA AACAGGCCAG ATTGATGTAA ATCGACAAGG GGATGAAATT 
ATTGGTAAAA CTTCCTTTGA TGGGACACCG CAACTTCTTT GGAATGGCAC AAAAGTAGTG 
GATAAAGATG GCAATGACGT AACTTCGGCC AACCAAAACT TTATCAGCTT AGCGAAATTT 
GACCAAGATA GCAGCAAATA TGAATTTTTC AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
TATGGCTACT TTAAAGTAGG AAATCAAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC 
AATCGCTATG GCGCTGTCTT AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA 
CGAATGGGTA AAGATAACGA AGGAAACGAT ATCCAAGTCT ATGTGGAACA TGAACCATAC 
CAAGGAACTT 



EF106-4 (SEQ ID NO:412> 

YVPQSVDN PNNLGDLPEY LRSVGIRQDE 
GLSEKDWAGT RVYDRNGNDL TDENQNLLHA 
AGITDVSRLV IISETKNYQG VYPLRTLYQD 
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI 
KDGNDVTSAN QNFISLAKFD QDSSKYEFFN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI 

EF107-1 (SEQ ID NO:413) 



IKFDATTSFY EFFDKETGES TGDEGTFFMT 
TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 
DVNRQGDEI I GKTSFDGTPQ LLWNGTKWD 
LQTGETRGDY GYFKVGNQNK FRAHVSIGTN 
QVYVEHEPYQ GT 



TAAAAAACGG CACTCAATAT GTCAAAATTT GAAATTTCAA GCTGTGTGTT CTTTGGTAAA 
ATANATANAA AAATGCTAGT TATCAGTATC GATAATAACA GGATACTGAT TAAGAAAGGA 
CTTTATAGAG ACTATAGATT GAATTTTTAC ATAGAAAGAA GGAGCAAGAT GAAGCGAGTA 
AATTGGAAAA GATGGCTAGT TGTTGGGTTA AGTTGTTCTT TGTTCATGGA TTCAGTGGTT 
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GGTGTGACTG TGTTAGCGGA AACGATTACT GGGGCGACGG AGCAAGGAGT AGCAACATCT 
CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG GCGCGAGATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGCCTGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT 
GGGGGCAAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTAGCT 
ACGGCACCAG GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGC TATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGGGA AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG 
AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TGCGTATACA 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG 
AGTATCTATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAACTAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC 
AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 
GTGTCCATCA CGTTGGATCA GCCTTTACCA GCTGGCGGTC AATTAAAAAT GAACTTATTA 
GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAACCG CTGATGTTGA AGTAACGGGC 
AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT 
ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA 
ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC 
ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT AACGGCCCAG 
CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA 
ACGGCCGCTG CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT 
GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC 
CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGCTAA CATCAAACTA 
GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT 
TTAGTGACTG GCCCCTAA 



EF107-2 (SEQ ID NO:414) 
MKRVN 

WKRWLWGLS CSLFMDSWG VTVLAETITG 
SEAKTVPPQE TARIASRAIG YSSVEGREIP 



ATEQGVATSQ SSDEASQTTQ TTEESQATVA 
FFFVEEDGTL FDPDRITMAV NLSTFSFYEE 
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KLQRTPLEPT TVNGGKLLS I PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTNE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFENPAG GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL PERYTGSDGK TYLFKGWYKG NAKPSTLETT KTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLINPAK FKLSATMGES DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYQTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
IYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFTQGN QLWDSENYV 
YTVAKALPK I YQAGEKTYIF QGWFKGKTKP ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIKIK PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV SITLDQPLPA GGQLKMNLLG TAVTGNPGQV LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQQYGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAASFTDYN QPTETRTPLG 
KTSTVTLTAD NTATAWANQ QFTGSDVYQL DFTFANIKLE VPANQGMAGQ QYQAAVTWNL 
VTGP 

EF107-3 (SEQ ID NO:415) 
GG AGCAAGGAGT AGCAACATCT 

CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG GCGCGAGATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGCCTGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT 
GGGGGCAAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTAGCT 
ACGGCACCAG GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGCTATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGGGA AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG 
AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TGCGTATACA 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG 
AGTATCTATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAACTAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC 
AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 
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GTGTCCATCA CGTTGGATCA GCCTTTACCA GCTGGCGGTC AATTAAAAAT G AAC TT ATT A 
GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAACCG CTGATGTTGA AGTAACGGGC 
AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT 
ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA 
ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC 
ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT AACGGCCCAG 
CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA 
ACGGCCGCTG CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT 
GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC 
CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGCTAA CATCAAACTA 
GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT 
TTAGTGACTG GCCCCT 

EF107-4 (SEQ ID NO:416) 

EQGVATSQ SSDEASQTTQ TTEESQATVA 

SEAKTVPPQE TARIASRAIG YSSVEGREIP FFFVEEDGTL FDPDRITMAV NLSTFSFYEE 
KLQRTPLEPT TVNGGKLLSI PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTNE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFENPAG GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL PERYTGSDGK TYLFKGWYKG NAKPSTLETT KTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLINPAK FKLSATMGES DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYQTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
IYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFTQGN QLWDSENYV 
YTVAKALPKI YQAGEKTYIF QGWFKGKTKP ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIKIK PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV SITLDQPLPA GGQLKMNLLG TAVTGNPGQV LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQQYGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAASFTDYN QPTETRTPLG 
KTSTVTLTAD NTATAWANQ QFTGSDVYQL DFTFANIKLE VPANQGMAGQ QYQAAVTWNL 
VTGP 

EF108-1 (SEQ ID NO:417) 

TAATCGGTTT GGCGGGAATC GTACATAGAA AGAAGGGACG . ACATGAAGCA AACTAAGTGG 
CAACGATTAG CAACCATTGG CTTGTGTAGT TCTTTAGTAA TTAACGCCTT TTCTGGTGTG 
ACGGCAGTTG CGGAAACCGT GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG 
GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 
AC C AGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG GAAAACCAAA 
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ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG 
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATACGAAA 
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGTCA ATTATTATAC AATGAGTGGC 
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC 
GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA 
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA 
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 
GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC 
TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 
AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT 
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCCAAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAAAAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT 
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AGGTCCTTAA 

EF108-2 (SEQ ID NO:418) 

MKQTKWQ RLATIGLCSS LVINAFSGVT AVAETVTIES SPTAESSAKE 

ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI IPKPKKINMK 

ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM VQGSASTVGN 

LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGKID STYPLPAIAL 

AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVINSDPYT FQQSGFLPET 
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YKVGTKSYRF KGWYKGKTKT EPLATTKTPS YKVTYDDNDD LTWYEEFSG YELPASTNQF 
GFVDEATNKL IAPDQVQMKY NLTLNENNKK TVMSSNLTGT DTATLKNLSV PVNYFEQYRV 
NTFYGASDIT FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF NVDKLAIDQQ 
LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEEIP TASVTLTRPK EVIDTNTNVI WTTTITNTSK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKKVS VAFTTRATGK 
PNTVLKAEW VFGGIKDSTV DNFVRIRPND QEWTPTTEG FISVPTFDFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL LLGAAPVSSF 
TNYNQPTELK NTVGTTSAIS LTANNTATS I IANKQFTGSN VYQLDFTFNN VKLEVPANQG 
VKGQQYKAAV TWNLVTGP 

EF108-3 (SEQ ID NO:419) 

CGT GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG 

GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG GAAAACCAAA 
ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG 
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATACGAAA 
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGTCA ATTATTATAC AATGAGTGGC 
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC 
GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA 
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA 
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 
GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC 
TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 
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AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT 
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCCAAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAAAAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT 
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AG 

EF108-4 (SEQ ID NO:420) 

VTIES SPTAESSAKE 

ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI IPKPKKINMK 
ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM VQGSASTVGN 
LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGKID STYPLPAIAL 
AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVINSDPYT FQQSGFLPET 
YKVGTKSYRF KGWYKGKTKT EPLATTKTPS YKVTYDDNDD LTWYEEFSG YELPASTNQF 
GFVDEATNKL IAPDQVQMKY NLTLNENNKK TVMSSNLTGT DTATLKNLSV PVNYFEQYRV 
NTFYGASDIT FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF NVDKLAIDQQ 
LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEE I P . TASVTLTRPK EVIDTNTNVI WTTTITNTSK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKKVS VAFTTRATGK 
PNTVLKAEW VFGGIKDSTV DNFVRIRPND QEWTPTTEG FISVPTFDFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL LLGAAPVSSF 
TNYNQPTELK NTVGTTSAIS LTANNTATSI IANKQFTGSN VYQLDFTFNN VKLEVPANQG 
VKGQQYKAAV TWNLVT 



EF109-1 (SEQ ID NO:421) 
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AGGAGTAAAT TAATGAAAAA AAGTGTTATA 
GGATTTCTCG TTACCCCTAT TTCTGCTTAC 
GAAACGGTGG CTTCAGAAAC ATCTCTAACG 
GAAATGAACC CAAGCATCAT AAATTCTCAA 
ACCTCCGATT CCACCACTGA AGTTTCTACA 
NATAGTAGCG ACGTACTGAA ACTACTTTGG 
TAG 



ACTAGTTCTA TGTTAGCGGT TTTGTTGTCG 
GCTTTGGAAC GCTCTAAGGG AACTACTGAA 
GAGCGACAAA TGAGTAGCGG TGTCACTGAA 
GAGGAAACAG AAACAACGTC CACTTCCTCA 
TCAGAAGTAA CAACTGTTAA TGATACAGAA 
NAACATCACN AAGTAATGAG GACACACCTA 



EF109-2 (SEQ ID NO:422) 

MKKSVI TSSMLAVLLS GFLVTPISAY ALERSKGTTE ETVASETSLT ERQMSSGVTE 
EMNPSIINSQ EETETTSTSS TSDSTTEVST SEVTTVNDTE XSSDVLKLLW XHHXVMRTHL 

EF109-3 ( SEQ ID NO:423) 

GGAAC GCTCTAAGGG AACTACTGAA 

GAAACGGTGG CTTCAGAAAC ATCTCTAACG GAGCGACAAA TGAGTAGCGG TGTCACTGAA 
GAAATGAACC CAAGCATCAT AAATTCTCAA GAGGAAACAG AAACAACGTC CACTTCCTCA 
ACCTCCGATT CCACCACTGA AGTTTCTACA TCAG 

EF109-4 (SEQ ID NO:424) 

ERSKGTTE ETVASETSLT ERQMSSGVTE EMNPSIINSQ EETETTSTSS TSDSTTEVST S 
EF110-1 (SEQ ID NO:425) 

TAAATAAAAA TGGATAAGGA GTGGCATAAT CTTATGAAAA AGTTCTCCAT ACGAAAAATT 

AGTGCTGGTT TTTTGTTTCT GATTTTAGTA ACTTTGATCG CCGGTTTTAG CTTGTCTGCA 

AATGCAGAAG AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG 

GACCCTGAGG ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC 

GGAAGAATCA TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT 

GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA 

GTATTAAATC CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG 

ACACCATTTG GCAAATTCAA AGTGATTGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG 

GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA 

ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT 

CCAGGTGAGA AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT 

ACAGACTTAG AAAATCCATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT 

TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG 
CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGCCAG CAGCGTAA 



EF110-2 (SEQ ID NO:426) 

MKKFSIRKIS AGFLFLILVT LIAGFSLSAN 
PEDRRQEVAD TTEAPFASIG RIISPASKPG 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV 
PFVLKKFESS DTHVTISGYP GEKNHTQWSH 
PIYNDQVEW GVHSNGGIKQ TGNHGQRLNE 

EF110-3 (SEQ ID NO:427) 



AEEYIVPAES HSRQKRSLLD 
YISLGTGFW GTNTIVTNNH VAESFKNAKV 
AFSPNADIAV VTVGKQNDRP DGPELGEILT 
ENDLFTSNFT DLENPLLFYD IDTTGGQSGS 
VNYNFIVNRV NEEENKRLSA VPAA 



AG AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG 
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GACCCTGAGG ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC 
GGAAGAATCA TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT 
GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA 
GTATTAAATC CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG 
ACACCATTTG GCAAATTCAA AGTGATTGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG 
GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA 
ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT 
CCAGGTGAGA AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT 
ACAGACTTAG AAAATCCATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT 
TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG 
CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGCCAG CAGCGT 

EF110-4 (SEQ ID NO: 428) 

EYIVPAES HSRQKRSLLD 

PEDRRQEVAD TTEAPFASIG RIISPASKPG YISLGTGFW GTNTIVTNNH VAESFKNAKV 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV AFSPNADIAV VTVGKQNDRP DGPELGEILT 
PFVLKKFESS DTHVTISGYP GEKNHTQWSH ENDLFTSNFT DLENPLLFYD IDTTGGQSGS 
PIYNDQVEW GVHSNGGIKQ TGNHGQRLNE VNYNFIVNRV NEEENKRLSA VPAA 

EF111-1 (SEQ ID NO:429) 

TGATCAATAC ACTTCGATAC GGTCGCTTTT TTTCTAGAGA AAGTTGAATC TTTCAATAAT 
AAAAAGGGAT ACACTCCATT TGGCATAGTC CTTGCTGATA ATAAATCAGT GTATAAAGCG 
CTATCATTTT ATAGGAGGGG TTTTATGAAG GGTTTATCAA AAAAGAAACG GGTGTCTACT 
TGGTTAGCGT TAGGAATCAC CGTAGTCAGC TGTTTTGCGT TAAGCAGGGA AGTGCAAGCA 
AGTGTTGAAA GAACAAAAGT TGATGAATTT GCAAATGTTT TAGATGTGAG TGCATCACCA 
ACCGAACGGA CGAATGGCGT ATACGATACC AATTATTTTA ATAATTTTTC TGATTTAGGT 
GCATGGCATG GCTACTATTT ACCTGAAAAA AGCAATAAAG AGCTACTGGG TGGTTTTGCG 
GGGCCATTGA TTATTGCGGA AGAATATCCA GTAAACTTGG CGGCAAGTTT AAACAAATTA 
ACGGTCAAAA ATAAAAAAAC GGGAGAAACC TATGATTTAA GCCAAAGCAA CCGCATGGAC 
CTGTCTTATT ATCCTGGGCG CCTAGAGCAA ACCTATGAAT TAGACGATTT AACGATTCAT 
TTAGCTTTAA TTTTTGTCAG CAATCGAACG GCGCTTATCC AAACGACACT TGAAAACACT 
GGTGAAGAGC CCTTGTCACT TGGAGCAAGC TGGACAGGTG CGGTCTTTGA CAAAATTCAA 
GAGGGAACGG AAACCTTAGA TATTGGCACT CGTTTAACTG CTAAAGACAA TGACATTCAA 
GTGAATTTTG GTGAAGTCAG AGAAACGTGG AATTATTTTG CTACGAAAGA CACAAAATAT 
ACGATTCATC ATGCGGATAA AGTTTCAACA AAAATTGATA ATCGGAATTA TACAGCAACC 
GCTGAACCAA TTGAATTGAA GCCTAAACAA ACGTACAACA CCTATACGAC AGAAAGCTAT 
ACTTTTACAA AAGAAGAAGA GGCAAAGGAA CAACAACAAG CACCCGAATA TACCAAAAAT 
GCGGCGCGCT ATTTCAAAGA GAACAAGCAA AGATGGCAAG GATATCTAGA TAAAACGTTT 
GATCAAAAGA AAACAGCAGA ATTTCCTGAA TATCAAAATG CGCTAGTCAA ATCGATTGAA 
ACGATTAATA CCAATTGGCG AAGTGCGGCA GGTGCCTTTA AGCATGACGG GATTGTTCCG 
TCCATGTCTT ATAAATGGTT TATTGGTATG TGGGCTTGGG ATTCGTGGAA AGCGGATGTA 
GCAACGGCTG ATTTTAATCC TGAGTTAGCT AAAAATAATA TGCGGGCCTT GTTTGATTAT 
CAAATTCAAA AAGATGATAC CGTACGTCCA CAAGATGCAG GAGCGATCAT TGATGCTGTC 
TTTTACAATC AAGACAGTGC GCGTGGTGGT GAAGGTGGCA ACTGGAATGA ACGAAATTCT 
AAACCACCAT TGGCTGCATG GGCAGTTTGG CATATTTATC AAGAAACCAA AGATAAGGAA 
TTTTTAAAAG AAATGTATCC CAAACTTGTG GCTTATCATA ATTGGTGGTA TACCAACAGA 
GACCACAATA AAAATGGGAT AGCAGAATAT GGAAGCATGG TCAGTGATGC TCACTGGCAA 
AAAGACGACA AGGATCAAAT CATTAAAGAT AAAAATGGCC ACCTAAAGTG GATGATGATG 
CTGTTATTGA AGCAGCCGCG TGGGAAAGTG GCATGGATAA CGCTACACGG TTTGACAAAG 
AAGGTGTGGG CAAAGGCGAC GTTGGAGTTA AAGTTTTTGA AAACAAAAAT AAAGGAAAAG 
TAG 
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EF111-2 (SEQ ID NO:430) 



MKG LSKKKRVSTW 

LALGITWSC FALSREVQAS VERTKVDEFA 
WHGYYLPEKS NKELLGGFAG PLIIAEEYPV 
SYYPGRLEQT YELDDLTIHL ALIFVSNRTA 
GTETLDIGTR LTAKDNDIQV NFGEVRETWN 
EPIELKPKQT YNTYTTESYT FTKEEEAKEQ 
QKKTAEFPEY QNALVKSIET INTNWRSAAG 
TADFNPELAK NNMRALFDYQ IQKDDTVRPQ 
PPLAAWAVWH IYQETKDKEF LKEMYPKLVA 
DDKDQIIKDK NGHLKWMMML LLKQPRGKVA 



NVLDVSASPT ERTNGVYDTN YFNNFSDLGA 
NLAASLNKLT VKNKKTGETY DLSQSNRMDL 
LIQTTLENTG EEPLSLGASW TGAVFDKIQE 
YFATKDTKYT IHHADKVSTK IDNRNYTATA 
QQAPEYTKNA ARYFKENKQR WQGYLDKTFD 
AFKHDGIVPS MSYKWFIGMW AWDSWKADVA 
DAGAIIDAVF YNQDSARGGE GGNWNERNSK 
YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK 
WITLHGLTKK VWAKATLELK FLKTKIKEK 



EF111-3 (SEQ ID NO:431) 



TGATGAATTT GCAAATGTTT 
ACCGAACGGA CGAATGGCGT 
GCATGGCATG GCTACTATTT 
GGGCCATTGA TTATTGCGGA 
ACGGTCAAAA ATAAAAAAAC 
CTGTCTTATT ATCCTGGGCG 
TTAGCTTTAA TTTTTGTCAG 
GGTGAAGAGC CCTTGTCACT 
GAGGG AACGG AAACCTTAGA 
GTGAATTTTG GTGAAGTCAG 
ACGATTCATC ATGCGGATAA 
GCTGAACCAA TTGAATTGAA 
ACTTTTACAA AAGAAGAAGA 
GCGGCGCGCT ATTTCAAAGA 
GATCAAAAGA AAACAGCAGA 
ACGATTAATA CCAATTGGCG 
TCCATGTCTT ATAAATGGTT 
GCAACGGCTG ATTTTAATCC 
CAAATTCAAA AAGATGATAC 
TTTTACAATC AAGACAGTGC 
AAACCACCAT TGGCTGCATG 
TTTTTAAAAG AAATGTATCC 
GACCACAATA AAAATGGGAT 
AAAGACGACA AGGATCAAAT 
CTGTTATTGA AGCAGCCGCG 
AAGGTGTGGG CAAAGGCGAC 



TAGATGTGAG TGCATCACCA 
ATACGATACC AATTATTTTA 
ACCTGAAAAA AGCAATAAAG 
AGAATATCCA GTAAACTTGG 
GGGAGAAACC TATGATTTAA 
CCTAGAGCAA ACCTATGAAT 
CAATCGAACG GCGCTTATCC 
TGGAGCAAGC TGGACAGGTG 
TATTGGCACT CGTTTAACTG 
AGAAACGTGG AATTATTTTG 
AGTTTCAACA AAAATTGATA 
GCCTAAACAA ACGTACAACA 
GGCAAAGGAA CAACAACAAG 
GAACAAGCAA AGATGGCAAG 
ATTTCCTGAA TATCAAAATG 
AAGTGCGGCA GGTGCCTTTA 
TATTGGTATG TGGGCTTGGG 
TGAGTTAGCT AAAAATAATA 
CGTACGTCCA CAAGATGCAG 
GCGTGGTGGT GAAGGTGGCA 
GGCAGTTTGG CATATTTATC 
CAAACTTGTG GCTTATCATA 
AGCAGAATAT GGAAGCATGG 
CATTAAAGAT AAAAATGGCC 
TGGGAAAGTG GCATGGATAA 
GTTGGAGTTA AAGTT 



ATAATTTTTC TGATTTAGGT 
AGCTACTGGG TGGTTTTGCG 
CGGCAAGTTT AAACAAATTA 
GCCAAAGCAA CCGCATGGAC 
TAGACGATTT AACGATTCAT 
AAACGACACT TGAAAACACT 
CGGTCTTTGA CAAAATTCAA 
CTAAAGACAA TGACATTCAA 
CTACGAAAGA CACAAAATAT 
ATCGGAATTA TACAGCAACC 
CCTATACGAC AGAAAGCTAT 
CACCCGAATA TACCAAAAAT 
GATATCTAGA TAAAACGTTT 
CGCTAGTCAA ATCGATTGAA 
AGCATGACGG GATTGTTCCG 
ATTCGTGGAA AGCGGATGTA 
TGCGGGCCTT GTTTGATTAT 
GAGCGATCAT TGATGCTGTC 
ACTGGAATGA ACGAAATTCT 
AAGAAACCAA AGATAAGGAA 
ATTGGTGGTA TACCAACAGA 
TCAGTGATGC TCACTGGCAA 
ACCTAAAGTG GATGATGATG 
CGCTACACGG TTTGACAAAG 



EF111-4 (SEQ ID NO:432) 



DEFA NVLDVSASPT ERTNGVYDTN YFNNFSDLGA 

WHGYYLPEKS NKELLGGFAG PLIIAEEYPV NLAASLNKLT VKNKKTGETY DLSQSNRMDL 
SYYPGRLEQT YELDDLTIHL ALIFVSNRTA LIQTTLENTG EEPLSLGASW TGAVFDKIQE 
GTETLDIGTR LTAKDNDIQV NFGEVRETWN YFATKDTKYT IHHADKVSTK IDNRNYTATA 
EPIELKPKQT YNTYTTESYT FTKEEEAKEQ QQAPEYTKNA ARYFKENKQR WQGYLDKTFD 
QKKTAEFPEY QNALVKSIET INTNWRSAAG AFKHDGIVPS MSYKWFIGMW AWDSWKADVA 
TADFNPELAK NNMRALFDYQ IQKDDTVRPQ DAGAIIDAVF YNQDSARGGE GGNWNERNSK 
PPLAAWAVWH IYQETKDKEF LKEMYPKLVA YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK 
DDKDQIIKDK NGHLKWMMML LLKQPRGKVA WITLHGLTKK VWAKATLELK 
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EF117-1 (SEQ ID NO:433) 

TAATTCGATG GAGAAGGTGG TTTAGTGAAA AGATTTTCAT TTTTTTTACT AATTTTACTT 

GCTTTAACAG GTTGTAAATC CGGTGAAAAA GAATTTGATG AAGAATCTCT TCAAAATCTA 

AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT TCGTTTAAAT 

GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG TTTAATAAAA 

AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN TAATGAGCAA 

AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA CGGCTTTTTG 

AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAATTA G 

EF117-2 (SEQ ID NO:434) 

VKR FSFFLLILLA LTGCKSGEKE FDEESLQNLK ETXQSXSETE LQNGDVRLNE 
YISLKGEIVE SDSRSSLIKK GDRFILKSGS SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK 
GTLIESEENH DSATN 

EF117-3 (SEQ ID NO:435) 

TG AAGAATCTCT TCAAAATCTA 

AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT 
GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG 
AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN 
AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA 
AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAA 

EF117-4 (SEQ ID NO:436) 

EESLQNLK ETXQSXSETE LQNGDVRLNE YISLKGEIVE SDSRSSLIKK GDRFILKSGS 
SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK GTLIESEENH DSATN 

EF118-1 (SEQ ID NO:437) 

TGAGGGGGAA AAAGTGTGTT AAAAAGAAAA GTGGGGATTG TCGCAGGCGT TTTCTGTTCA 
GCTTTGTTAC TGACAGGTTG TGGCAAAAGT GCGAAAGATG AGTTCATTCA AGGAATCGGC 
AATCANAACG CACAAGAATC TGGGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA 
TTTTCACAAG AAGATGGTGC ACAAACGAAT CCTATGATTG GGATGCTCAT CACGCAAATC 
AAAGACGCAT CGCTTTCTGG GGAAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA 
CTTAGAGATG AAATTAAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT 
AGATAA 

EF118-2 (SEQ ID NO:438) 

VLKRKV GIVAGVFCSA LLLTGCGKSA KDEFIQGIGN XNAQESGVXD FSMSISDMKF 
SQEDGAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR 

EF118-3 (SEQ ID NO:439) 

GAAAGATG AGTTCATTCA AGGAATCGGC 

AATCANAACG CACAAGAATC TGGGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA 
TTTTCACAAG AAGATGGTGC ACAAACGAAT CCTATGATTG GGATGCTCAT CACGCAAATC 
AAAGACGCAT CGCTTTCTGG GGAAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA 
CTTAGAGATG AAATTAAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT 
AGAT 



TCGTTTAAAT 
TTTAATAAAA 
TAATGAGCAA 
CGGCTTTTTG 
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EF118-4 (SEQ ID NO:440) 

KDEFIQGIGN XNAQESGVXD FSMSISDMKF SQEDGAQTNP MIGMLITQIK DASLSGEDSS 
RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR 

EF119-1 (SEQ ID NO:441) 

TAAAGAATAC CGAGTAAAAT TTTCGGAAGG CTTTTTTTCA AAAATTGTAT ATGCAAAAGA 

AGTGCAACGG AAAGGAGCTC GGAAATCGTG AATAAGCTAC CTTTACTTAT TTTATTGTTA 

GGCGGAGTGT TGCTTGTTAG TGGCTGTCAA AGCCATAAGG AAGAAAACAA GTCTAGTAAA 

GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA GGGAACAATC GAAGGAATCG 

TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA AATTAGAAGA ACCAGATCAT 

GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT TTACAAGTAT TAATGATCGC 

AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA AAAAAAATGG AATTGATGTT 

AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA CGATTTATAA AAATGATCAA 

CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 
TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT ATAATTCAGT TAAGCAAGAG 
TATTAG 



EF119-2 (SEQ ID NO:442) 

VN KLPLLILLLG GVLLVSGCQS HKEENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE 
TTKLEEPDHV KLLEAYGNAY ANFTSINDRN EKLKPLMTEK CIKKNGIDVK TGVALVSVGK 
VTTIYKNDQH EYALLLDCEQ NGTQTRVLLL AKVKNNKISE MTYNSVKQEY 

EF119-3 (SEQ ID NO:443) 

AGAAAACAA GTCTAGTAAA 

GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA GGGAACAATC GAAGGAATCG 
TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA AATTAGAAGA ACCAGATCAT 
GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT TTACAAGTAT TAATGATCGC 
AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA AAAAAAATGG AATTGATGTT 
AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA CGATTTATAA AAATGATCAA 
CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 
TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT ATAATTCAGT TAAGCAAGAG 
TAT 

EF119-4 (SEQ ID NO:444) 

ENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE TTKLEEPDHV KLLEAYGNAY 
ANFTSINDRN 

EKLKPLMTEK CIKKNGIDVK TGVALVSVGK VTTIYKNDQH EYALLLDCEQ NGTQTRVLLL 
AKVKNNKISE MTYNSVKQEY 

EF120-1 (SEQ ID NO:445) 

TGAATAGGCG TGAAAAAGGG AATGTTAGCG TTTTTTGTCG TGCTAGCGGT TTTATCATTA 
ACTGCTTGTC GGGAACCAAA AGNAAAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 
GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT 
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT 
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ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA 
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT 

EF120-2 (SEQ ID NO:446) 

VKKGMLAF FWLAVLSLT ACREPKXKKV TASTEASSKV EETNEKTSET IDKTNEQASS 
SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 

EF120-3 (SEQ ID NO:447) 

AAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 

GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT 
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT 
ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA 
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT 

EF120-4 (SEQ ID NO:448) 

KKV TASTEASSKV EETNEKTSET IDKTNEQASS 

SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 



EF121-1 (SEQ ID NO:449) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG TTGAGCTTTA AAAAAGTGAA GTGGGGCATG 
CATTTTTTAA TGGCTGTTGC GTTGATAGCG CCAAGTGTTA CTAGTACGGC ATATGCAGTA 
GAAACAACGA GTCAACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 
CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT 
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG AAGATGATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
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GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA 
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT 
TAG 

EF121-2 <SEQ ID NO:450) 

VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE TTSQQSSEAV TSTTDSSRKQ 
EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TS I AEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADI I VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
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PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 

PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 

ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 

NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 

PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 

SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 

IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 

FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 

GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 

IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 

RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 

KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL FVGLSSSGWY IYKRRNKAS 

EF121-3 (SEQ ID NO:451) 

ACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 

CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT 
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG AAGATGATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
. ATT AAAGG AA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTC TG A TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCG 

EF121-4 (SEQ ID NO:452) 



QSSEAV TSTTDSSRKQ 

EPVITQETTD IKQEAPNQAT SDSVKQSQET 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN 



TAPTETTNLE TSIAEKEETS TPQKITILGT 
QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
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KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN A 



EF122-1 (SEQ ID NO:453) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG 
CATTTTTTAA TGGCTGTTGC GTTGATAGCG 
GAAACAACGA GTCAACAAAG TTCAGAAGCA 
CAAGAACCAG TCATTACACA GGAAACAACA 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG 
ACGTCAGATG TTCATGGTCA ATTATGGAAT 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT 
ACCGTTTTAA TTGATAATGG CGACAATATT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC 
AAGTATGATG CAATGGTTTT GGGAAATCAT 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC 
GGTCTTCGTT TTGTTGAAGG GACTACCACG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT 
GAGTTGAAAG CTAACGATCA GGCTGACATT 
AATAGTGATC CGGCTGCCAG TGCCGACCAA 
TATATTCTGG GTCATGACCA CCTTTCTTTT 
ACTGTACCGG TAGGGGGACC GAAAGATACG 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA 
GTTCAAAAAG AAGTAACGGG CGCACAATTA 
AAATTACCTG CGGGGAAGAT TTCCTATGCC 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA 
GCGTACTATA ACCAAACACA GCCAGATGAT 
GTATATAACT ATGACATGAT TTCTGGAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC 
ACGATTGCTA TGAATAATTA TCGTTACGGC 
GAACCTATTA AAAATTCTGA TCCAGAAACC 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA 
ATTCCGACTT CTCCTGATGG ACGTACACCA 
CGTAATGCGG GCTTTGATTT AGATAATGCA 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA 
CAAGAAAACC CAACCTTGAT GGTGGATGCA 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT 
CTGAATTTTC CGATTTTATC TAGTAATACG 
GATCCGTATA CAATCGTAGA AAAATCCGGG 



TTGAGCTTTA AAAAAGTGAA GTGGGGCATG 
CCAAGTGTTA CTAGTACGGC ATATGCAGTA 
GTAACAAGTA CCACCGATTC AAGTAGAAAA 
GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACCACAGCAC CAACAGAGAC GACGAATTTA 
AGCACGCCGC AAAAAATAAC AATTTTAGGT 
TGGTCTTATG AAGATGATAA AGAACTACCA 
AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
CAAGGCACTA TTTTAACAGA TGACTTGTAT 
CATCCAATGA TCACCGCCAT GAATGTGATG 
GAGTTTAATT TTGGTTTACC GTTAATCAAA 
TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
AAGGAACTTG ATTTTAATCA AGATGGGCAG 
ACAATTCCGC ACATTCCTTT GTGGGATGGC 
TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
ATTGTTGCCT CGATTCATGC GGGACAACAA 
GTAATTGAAA ATGTCGCGGG GATTGATGCG 
ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
CAAGAAGGTA CAGCAACGAT TGTACCAACA 
GCAGCGACAA AAGAATACCA TGAAAAAACG 
GCAACAGCTG ATTTTTTACC AAAACAAGAA 
CCAACAGCGA TGATTTCTTT AATTAATAAC 
AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
ACGATTTTTG ATATCTACAA ATACCCGAAT 
AACTTACTGA AGTATTTAGA AAAACAAGGG 
TTGACCATTA GTTTTAATCC AAACATTCGT 
GACTACAAGA TTGACATTTC AAAACCAGTG 
GGCCAACCGC TGGATCCTGC CAAAGAATAT 
GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
TTACGAGGAA TGATTGTTGA TTATATTAAG 
ATCGAACGAA ATTGGTCAAT TATTGGGACA 
ATCGAATTAG TGAATGACGG CACTCTTCAA 
AACGCCGCCG CTATTACGAA ACAAGATGTC 
TATACCATTA TGCACACAAA TGACGTTCAT 
GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
GGGGATGTTT TCCAAGGATT ACCAATCTCC 
GCAATGAATG AAGTTGGTTA TGATGCCATG 
TTAGAGATTG CACTAGGTTA TAAAGACCAA 
TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
AAAAAGTTTG CCATTGTAGG TGTGACGACC 
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CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AAC TAAG AAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA 
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT 
TAG 



EF122-2 (SEQ ID NO:454) 

VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE TTSQQS SEAV TSTTDSSRKQ 
EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL FVGLSSSGWY IYKRRNKAS 



EF122-3 (SEQ ID NO:455) 
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TG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAG AC C AA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GC ACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GC CAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GC TTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC C AG ATCC AAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TT 

EF122-4 {SEQ ID NO:456) 

EKWRAKAI ELVNDGTLQI 

PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGF 

EF123-1 (SEQ ID NO:457) 

TAAAATAAAA AATTGGTACG AAGTGAACGT TCTCTTCTAT GTCTCGTTAG TAGAGGAAGG 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG GTAAACCGTT GGCTCTACGG GTTGATGTGT 
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TTGTTACTTG TTCTAAATTA TGGCACACCA CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATC CG ATTTATTTAA ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
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CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 

EF123-2 (SEQ ID NO:458) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VWHYSANVS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 
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TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 

EF123-3 (SEQ ID NO:459) 

GGAAGA GGTTAACAGC 

GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 



WO 98/50554 



PCT/US98/08959 



226 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT. 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGG 

EF123-4 (SEQ ID NO:460) 

EEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VWHYSANVS IKAAHWAAPN NTRKIQVDDQ 

KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 

SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 

TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 

PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 

GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 

VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 

YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 

EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 

DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 

SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 

TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID W 



EF124-1 (SEQ ID NO:461) 

TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 
TTGTTACTTG TTC TAAATTA TGGCACACCA 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG 
CTTCAAGGAA AAGCACAACC AGTAACACAA 
TCAATCAAAG CTGCACATTG GGCAGCGCCC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT 
ACGTTGAACC CTACAGCTAC AGAAGATGTG 
TTGACGTTAA AGACTGGTAC TGATCCGACA 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA 
ACAACGATTA TCGATAATTT TGAAGATCCG 
ACACCGCCGT ATAAAGAAGA TGTGACCATT 
GATGTGCGAG AACAAATGAA AGCAGGCGAT 



TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
GTAAACCGTT GGCTCTACGG GTTGATGTGT 
CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
GAGGTTGTAG TGCATTATAG TGCCAATGTG 
AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTTTCTT ATGGACAACA GCAACGAGCG 
GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
GCGATAGAAA GTAAAACAAC TGAATCGACG 
GATATCAGTG ATTATTTTAC AGGTGATGAA 
ATTTATTTAA ATCCTGATGG AACACCAGCA 
CATTGGAACT TTAACTGGTC GATTCCAGAA 
TACTTCGAGT TTCAATTACC TGGCAATTTG 
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AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGC TTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
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CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GC AG AGC ACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 

EF124-2 (SEQ ID NO:462) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE WVHYSANVS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 
TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
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PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 

EF124-3 (SEQ ID NO:463) 

TGC C TTC C AC ATAAC TT ATACT AC C TTTTTG ACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGG AAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG G 



EF124-4 (SEQ ID NO:464) 
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AF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW 
PDKSVILEEG KDYTLEVTTD NETGQQKIW 
QVSITGNGSE WHGDDNGDV WDIDHSGGH 
QAKTQVLREG TVDATGVITF GGLPQGQYIL 
GAQPTI IKND VNKVFLEKMD EKGKKLVNAR 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN 
SAFTIAASDR GKPATVIATA NFVNYQGTAK 
TTNNQG 



PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
ATGTKGKIQL KKTAMDETTI LAG AH FQ I WD 
VETKAPEGYT VSDELAKGRV ITIDEETSAE 
FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
LIKKDVNGHL LSGATFKVLD AKGETIQTGL 



EF125-1 (SEQ ID NO:465) 

TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 
TTGTTACTTG TTCTAAATTA TGGCACACCA 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG 
CTTCAAGGAA AAGCACAACC AGTAACACAA 
TCAATCAAAG CTGCACATTG GGCAGCGCCC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT 
ACGTTGAACC CTACAGCTAC AGAAGATGTG 
TTGACGTTAA AGACTGGTAC TGATCCGACA 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA 
ACAACGATTA TCGATAATTT TGAAGATCCG 
ACACCGCCGT ATAAAGAAGA TGTGACCATT 
GATGTGCGAG AACAAATGAA AGCAGGCGAT 
AAACCTAATA AACCAGGTTC AGGTGATTTA 
TACACAATTA GTGAAGATGG TACGGTTCGT 
AGTGACATTC ACGGGGACTT TTCTTTAGAT 
CCAGGAGATT GGGTGATTGA TATTCCTACA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC 
ACTGTGACGG AAACATGGCC AACAGGGAAT 
GTGATGAATC TTGATGGAAC AATTAAAGAA 
ACCGTTGATA AAAATGGCAA TGTGACGATT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT 
GTTACCGCCA CATATGGCAA AATGTTAGAC 
CAAGAATTCA CTTGGGAAAT TAACTACAAC 
GCAGTCATTA CAGACACAAT GGGGGATAAT 
TATTCAGTGA CATTTGATGA CAAAGGAAAT 
AAAGATTACA AAGTGGTAAT CAACGGAGAC 
GTGACTGGCG CAGTCAAGAT TGATTATAAA 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC 
GCCAGTCAAC AAAATATTAT TAAAAACACT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG 



TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
GTAAACCGTT GGCTCTACGG GTTGATGTGT 
CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
GAGGTTGTAG TGCATTATAG TGCCAATGTG 
AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTTTCTT ATGGACAACA GCAACGAGCG 
GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
GCGATAGAAA GTAAAACAAC TGAATCGACG 
GATATCAGTG ATTATTTTAC AGGTGATGAA 
ATTTATTTAA ATCCTGATGG AACACCAGCA 
CATTGGAACT TTAACTGGTC GATTCCAGAA 
TACTTCGAGT TTCAATTACC TGGCAATTTG 
GTTGATGCAG AAGGCAATGT CTATGGAACC 
TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
AAACAAGGCC ATTTTGATCG AACGCCCAAT 
AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGGGTCGCG AACTTAGTCC AGATGAATAT 
AAAGGTGACA CCAACAAAGC GTATCGTCTT 
ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AATCCAAATG GGTTAGATGC TGAAGCAACT 
AAGCGCAATA TAGATTACGA CGAAGCCAAT 
TATGGTGAAC AAACCATTCC AAAAGACCAA 
TTAACGTTTG AACCAGATTC TTTACATTTA 
GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
GGTTCCTTTG CAATTGACTT TTTACATGAT 
ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GGTGCAGTTG ATTATCAAAA TTCAACGATT 
TATTTGATGG AAAATGCCGT GATTACGGAT 
GTACCCAATT CGTTGGTTGT CAAAGATACA 
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ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GG AC TAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
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ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 



EF125-2 (SEQ ID NO:466) 



MRKNGPMV NRWLYGLMCL L] 
GQLTLGEVKQ TSQQEMTLAL 
KKQIQIELNQ QALADTLVLT 
SANEGSTEEA STNSSVPRSS 
TIIDNFEDPI YLNPDGTPAT 
PNKPGSGDLV DAEGNVYGTY 
GDWVIDIPTQ EDLPPWIPI 
VTETWPTGNT FKSVKVYELV 
YQTTIDEAVI PDGGGDVPFK 
EFTWEINYNY GEQTIPKDQA 
DYKWINGDG SFAIDFLHDV 
SQQNIIKNTG AVDYQNSTIG 
TGAQLTLGKD FMVEITRNAD 
DHYRNTAAID WTDEAGNNHH 
NRLVDAFLTD PILTNQTYLA 
WRVDFPNDSR TYVIEFKTSV 
KGGEYHKDDP DHVYWHVMIN 
PDKSVILEEG KDYTLEVTTD 
QVSITGNGSE WHGDDNGDV 
QAKTQVLREG TVDATGVITF 
GAQPTIIKND VNKVFLEKMD 
EVDSLKPGLY QFTEIEAPTG 
QAGNPLAGAE FSVLDTTGQA 
SAFTIAASDR GKPATVIATA 
TTNNQGEIVA EHLAPGKYRF 
KGAFQIVKTN SADQPLAGAV 
PKLPDGADYI IYPELVKVEI 
KLYRIENGEK IFEREVTAEK 
NDKQPLDELE FVNYQAEVMG 
VSEITTDKTG EIYAKGLNEG 
NYQGTAQLTK ENETGEALAG 
ETQAPTSYLL NETPSASFTI 
FKVTDAETGQ TVARSLRSDN 
KDKPELVNAG TFVNEKQPVS 
VGLASWLFYK KSKK 



jVLNYGTPL MALAEEVNSD 
QGKAQPVTQE VWHYSANVS 
LNPTATEDVT FSYGQQQRAL 
EETVASTTKA IESKTTESTT 
PPYKEDVTIH WNFNWSIPED 
TISEDGTVRF TFNERITSES 
VPDTEQQIDK QGHFDRTPNP 
MNLDGTIKEV GRELSPDEYT 
NHATLTSDNN PNGLDAEATV 
VITDTMGDNL TFEPDSLHLY 
TGAVKIDYKT KVDGIVEGDV 
WTLAVNQNNY LMENAVITDT 
GETGFKVSFI GAYAKTSDAF 
SEDSKPFKPL PAFDLNAQKS 
GSLKVYEGNT KPDGSVEKVK 
DEKVIEGSAS YDNTASYTNQ 
GAQSVLDDW ITDTPSPNQV 
NETGQQKIW KMAHIEAPYY 
WDIDHSGGH ATGTKGKIQL 
GGLPQGQYIL VETKAPEGYT 
EKGKKLVNAR FKLEHAVTTP 
YLLDTTPKRF IVTQNTSGQI 
VREHLVSDAN GKVTVTDLAP 
NFVNYQGTAK LIKKDVNGHL 
VETKAPTGYL LNTTPVPFEI 
FELYDHNKQS LGITATSGKD 
RGDFKGDPEI FQLGAFANFK 
DGSLAMEDLG AGSYELDELD 
RKVNEQGQTL AGAVFAIYNA 
HYVLVETKAP TGYLLDTTLH 
AVFKVIDETG QTVDGQTNLM 
AKDNQGKPAT WLKAPFINY 
QGLVQVNHLQ PGKYTFVETK 
KKTKPNQPTT KQAARETGWL 



IKAAHWAAPN NTRKIQVDDQ 
TLKTGTDPTE STAITSSPAA 
VKPRVAGPTD ISDYFTGDET 
VREQMKAGDY FEFQLPGNLK 
DIHGDFSLDT HLNDSDGRGP 
SAITWTVDIN QAMKDQTNPT 
VDKNGNVTIK GDTNKAYRLE 
TATYGKMLDK RNIDYDEANQ 
SVTFDDKGNE WGAELVEGK 
AVNNRVDVGT GQHSEDDGTA 
YEPVPGLTMV PNSLWKDTT 
HITYTTFFDV TELDANNPAL 
GVYNAVTKEI TWTIAVNLSN 
PTQPLTDITM EEPSEKNQNT 
GSSRDVTGKV SIQHGGESVK 
LDPESLVIYG TNVTEDGTIT 
MEYRSLVTSS AAGSTDTVSN 
KKTAMDETTI LAGAHFQIWD 
VSDELAKGRV ITIDEETSAE 
FTHWEEVPLA PDRTNANGQL 
RDVHVKMLNY QGSAELIKKD 
GKYQFVETKA PAGYLLNTEP 
LSGATFKVLD AKGETIQTGL 
AEKNAGKPAV WASDNFVSY 
GKIIFRDLAP GTYYYKEIKA 
GRAVFKKIDA NANPLPGTIF 
ATDGYIVNKQ PIYFWKKNS 
DEQNQPQGSP ITFLNRAGEK 
PFDVTAQLGK EQPIALGDLI 
SDKQGKVIAK NLAPGTYRFV 
QGAAKLVKID QQKNALAGAE 
APDGYQLSKQ AVAFTIAATA 
GLPKTNTQVN YFFVFIGLML 



EF125-3 (SEQ ID NO:467) 
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TAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGT 

EF125-4 (SEQ ID NO:468) 

NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 

TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWLG 

EF126-1 {SEQ ID NO:469) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
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TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF126-2 (SEQ ID NO:470) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ T I SATS TEG Y VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
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IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF126-3 (SEQ ID NO:471) 
TGAA 

GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGAT 

EF126-4 (SEQ ID NO:472) 

EE AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAID 

EF127-1 (SEQ ID NO:473) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
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AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGG AAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACC CAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF127-2 (SEQ ID NO: 474) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
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DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 

EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 

ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 

TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 

PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF127-3 (SEQ ID NO:475) 

GAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 

ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAAT 



EF127-4 (SEQ ID NO: 476) 
NQG TIAKEFPEAT 

I PKNDNAH AC DVTPEDPTIT KDIENQEHLD 
DINKVLDIID VKVTDENGKD VTANGTVTQE 
IKTDATDEEL APYIEQGGIP NQADLNFGNE 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV 

EF128-1 (SEQ ID NO:477) 



LTNREDSFDW HVKTAFGNET STWTQASMVD 
NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DDIN 



TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT- CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
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TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF128-2 (SEQ ID NO:478) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLWVE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGG I PNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF128-3 (SEQ ID NO:479) 
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AGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 

CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCAT 

EF128-4 (SEQ ID NO:480) 

DENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGG I PNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 



EF129-1 (SEQ ID NO:481) 

TGACAAGTGA AGAAACGTCT ATTTGCATCA GTATTACTAT GTTCATTAAC GCTATCAGCA 
ATTGCTACCC CAAGCATCGC TTTGGCGGAC AATGTTGATA AAAAAATTGA AGAAAAAAAT 
CAAGAAATTT CATCATTAAA AGCAAAACAA GGGGATTTAG CTTCACAAGT ATCTTCTTTA 
GAAGCAGAAG TATCTTCAGT ATTTGATGAA AGCATGGCTT TACGTGAACA AAAGCAAACA 
CTAAAAGCAA AATCAGAACA ATTACAACAA GAAATTACAA ACTTGAATCA ACGTATTGAA 
AAACGTAACG AAGCAATCAA AAATCAAGCA CGTGATGTTC AAGTTAATGG ACAAAGCACA 
ACAATGCTAG ATGCAGTTTT AGATGCGGAC TCAGTTGCAG ATGCAATCAG CCGTGTTCAA 
GCTGTTTCAA CAATCGTAAG TGCCAACAAC GACTTAATGC AACAACAAAA AGAAGACAAA 
CAAGCCGTTG TTGATAAAAA AGCTGAAAAC GAGAAAAAAG TGAAACAACT TGAAGCAACA 
GAAGCTGAAT TAGAAACAAA ACGTCAAGAT TTACTTTCTA AACAATCTGA ATTAAACGTA 
ATGAAAGCTT CATTAGCATT AGAACAATCA TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA 
AAACAAAAAG CAGCTGCTGA AGCAGAGCAA GCACGCTTAG CTGCTGAACA AAAAGCTGCA 
GCTGAAAAAG CCAAACAAGC TGCTGCAAAA CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA 
CCAGTTGCCT CTTCATCAAC AACAGAAGCA CAAGCACCAG CAAGCTCAAG CTCAGCAACT 
GAATCAAGCA CGCAACAAAC AACTGAAACA ACTACACCAA GTACAGATAA TAGTGCAACA 
GAAAATACTG GCTCTTCTTC ATCAGAACAA CCAGTACAAC CTACAACACC AAGCGATAAT 
GGAAATAATG GTGGCCAAAC TGGTGGTGGA ACAGTTACAC CAACACCAGA ACCAACACCA 
GCGCCTTCTG CTGATCCAAC AATCAATGCA TTGAACGTTC TACGTCAATC ATTAGGTTTA 
CGTCCAGTAG TATGGGATGC AGGTTTGGCA GCTTCTGCAA CTGCTCGTGC AGCACAAGTT 
GAAGCAGGTG GCATTCCAAA TGATCACTGG TCTCGTGGAG ATGAAGTTAT CGCAATTATG 
TGGGCGCCAG GTAACTCAGT AATCATGGCG TGGTACAATG AAACAAACAT GGTAACAGCT 
TCAGGAAGCG GTCACCGTGA TTGGGAAATT AACCCAGGTA TTACGCGTGT CGGTTTTGGT 
TACTCAGGTA GCACAATCGT AGGACACTCA GCCTAA 



EF129-2 (SEQ ID NO:482) 
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VKKRLFASV LLCSLTLSAI ATPSIALADN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE 
AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT 
MLDAVLDADS VADAISRVQA VSTIVSANND liMQQQKEDKQ AWDKKAENE KKVKQLEATE 
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA 
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE 
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR 
PWWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS 
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA 

EF129-3 {SEQ ID NO:483) 

GGAC AATGTTGATA AAAAAATTGA AGAAAAAAAT 

CAAGAAATTT CATCATTAAA AGCAAAACAA GGGGATTTAG CTTCACAAGT ATCTTCTTTA 
GAAGCAGAAG TATCTTCAGT ATTTGATGAA AGCATGGCTT TACGTGAACA AAAGCAAACA 
CTAAAAGCAA AATCAGAACA ATTACAACAA GAAATTACAA ACTTGAATCA ACGTATTGAA 
AAACGTAACG AAGCAATCAA AAATCAAGCA CGTGATGTTC AAGTTAATGG ACAAAGCACA 
ACAATGCTAG ATGCAGTTTT AGATGCGGAC TCAGTTGCAG ATGCAATCAG CCGTGTTCAA 
GCTGTTTCAA CAATCGTAAG TGCCAACAAC GACTTAATGC AACAACAAAA AGAAGACAAA 
CAAGCCGTTG TTGATAAAAA AGCTGAAAAC GAGAAAAAAG TGAAACAACT TGAAGCAACA 
GAAGCTGAAT TAGAAACAAA ACGTCAAGAT TTACTTTCTA AACAATCTGA ATTAAACGTA 
ATGAAAGCTT CATTAGCATT AGAACAATCA TCAGCTGAAA GTTCTAAAGC TGGC TTAGAA 
AAACAAAAAG CAGCTGCTGA AGCAGAGCAA GCACGCTTAG CTGCTGAACA AAAAGCTGCA 
GCTGAAAAAG CCAAACAAGC TGCTGCAAAA CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA 
CCAGTTGCCT CTTCATCAAC AACAGAAGCA CAAGCACCAG CAAGCTCAAG CTCAGCAACT 
GAATCAAGCA CGCAACAAAC AACTGAAACA ACTACACCAA GTACAGATAA TAGTGCAACA 
GAAAATACTG GCTCTTCTTC ATCAGAACAA CCAGTACAAC CTACAACACC AAGCGATAAT 
GGAAATAATG GTGGCCAAAC TGGTGGTGGA ACAGTTACAC CAACACCAGA ACCAACACCA 
GCGCCTTCTG CTGATCCAAC AATCAATGCA TTGAACGTTC TACGTCAATC ATTAGGTTTA 
CGTCCAGTAG TATGGGATGC AGGTTTGGCA GCTTCTGCAA CTGCTCGTGC AGCACAAGTT 
GAAGCAGGTG GCATTCCAAA TGATCACTGG TCTCGTGGAG ATGAAGTTAT CGCAATTATG 
TGGGCGCCAG GTAACTCAGT AATCATGGCG TGGTACAATG AAACAAACAT GGTAACAGCT 
TCAGGAAGCG GTCACCGTGA TTGGGAAATT AACCCAGGTA TTACGCGTGT CGGTTTTGGT 
TACTCAGGTA GCACAATCGT AGGACACTCA GCC 

EF129-4 (SEQ ID NO;484) 

DN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE 

AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT 
MLDAVLDADS VADAISRVQA VSTIVSANND LMQQQKEDKQ AWDKKAENE KKVKQLEATE 
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA 
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE 
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR 
PWWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS 
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA 

EF130-1 (SEQ ID NO:485) 

TGATACATTA AAAGGAGGGA AAATATGCGC CCAAAAGAGA AAAAAAGAGG AAAAAATTGG 
TTAATCAACA GTTTATTAGT TTTACTATTT ATCATTGGCT TAGCCTTAAT TTTTAACAAT 
CAGATACGTA GTTGGGTGGT TCAACAAAAT AGCCGCTCGT ACGCCGTTAG CAAGTTGAAA 
CCAGCTGATG TGAAGAAAAA TATGGCTCGT GAAACAACGT TTGACTTTGA TTCAGTTGAG 
TCCTTGAGCA CAGAAGCGGT GATGAAAGCC CAATTTGAAA ACAAAAACTT ACCTGTGATT 
GGTGCCATTG CGATACCAAG TGTCGAAATT AATTTGCCCA TTTTTAAAGG ATTGTCCAAT 
GTCGCTTTAT TAACTGGTGC CGGGACCATG AAAGAAGATC AAGTCATGGG GAAAAACAAT 
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TATGCCTTGG CTAGTCATCG AACGGAAGAT 
ACCAAAAAAG ACGAACTCAT TTATATCACT 
ACTTCTGTAG AAAAAATCGA ACCAACCCGT 
AATATGATTA CCTTAATTAC CTGTGGCGAT 
GGAACATTAG CAGCAACGAC GCCTATTAAA 
CAATTGGAGC AAAAAACTTT AGCCGATTGG 



GGCGTTTCCT TATTTTCACC TTTAGAAAGA 
GATTTATCTA CTGTTTATAC ATACAAAATA 
GTTGAGTTAA TTGATGACGT TCCTGGTCAA 
TTACAAGCAA CGACGCGAAT TGCTGTTCAA 
GACGCCAACG ACGATATGTT GAAGGCTTTC 
GTGGCTTAA 



EF130-2 (SEQ ID NO:486) 

YIKRRENMRP KEKKRGKNWL INSLLVLLFI 
ADVKKNMARE TTFDFDSVES LSTEAVMKAQ 
ALLTGAGTMK EDQVMGKNNY ALASHRTEDG 
SVEKIEPTRV ELIDDVPGQN MITLITCGDL 
LEQKTLADWV A 

EF130-3 (SEQ ID NO:487) 



IGLALIFNNQ IRSWWQQNS RSYAVSKLKP 
FENKNLPVIG AIAIPSVEIN LPIFKGLSNV 
VSLFSPLERT KKDELIYITD LSTVYTYKIT 
QATTRIAVQG TLAATTPIKD ANDDMLKAFQ 



CGTTAG CAAGTTGAAA 
CCAGCTGATG TGAAGAAAAA 
TCCTTGAGCA CAGAAGCGGT 
GGTGCCATTG CGATACCAAG 
GTCGCTTTAT TAACTGGTGC 
TATGCCTTGG CTAGTCATCG 
ACCAAAAAAG ACGAACTCAT 
ACTTCTGTAG AAAAAATCGA 
AATATGATTA CCTTAATTAC 
GGAACATTAG CAGCAACGAC 
CAATTGGAGC AAAAAACTTT 



TATGGCTCGT GAAACAACGT 
GATGAAAGCC CAATTTGAAA 
TGTCGAAATT AATTTGCCCA 
CGGGACCATG AAAGAAGATC 
AACGGAAGAT GGCGTTTCCT 
TTATATCACT GATTTATCTA 
ACCAACCCGT GTTGAGTTAA 
CTGTGGCGAT TTACAAGCAA 
GCCTATTAAA GACGCCAACG 
AGCCGATTGG GTGGCT 



TTGACTTTGA TTCAGTTGAG 
ACAAAAACTT ACCTGTGATT 
TTTTTAAAGG ATTGTCCAAT 
AAGTCATGGG GAAAAACAAT 
TATTTTCACC TTTAGAAAGA 
CTGTTTATAC ATACAAAATA 
TTGATGACGT TCCTGGTCAA 
CGACGCGAAT TGCTGTTCAA 
ACGATATGTT GAAGGCTTTC 



EF130-4 (SEQ ID NO:488) 



VSKLKP 

ADVKKNMARE TTFDFDSVES 
ALLTGAGTMK EDQVMGKNNY 
SVEKIEPTRV ELIDDVPGQN 
LEQKTLADWV A 



LSTEAVMKAQ FENKNLPVIG 
ALASHRTEDG VSLFSPLERT 
MITLITCGDL QATTRIAVQG 



AIAIPSVEIN LPIFKGLSNV 
KKDELIYITD LSTVYTYKIT 
TLAATTPIKD ANDDMLKAFQ 



EF131-1 (SEQ ID NO:489) 

TAGGCGGAGG TAAGCGGTAT GCGTAAACGA 
TGGCTTTTTA TAGTATGTTT GTTGGTGGTG 
TTCTTTTTCA CTAGAGATTC ACAAGTTAGT 
CGCCGAAGTG ATAATTATGC GAATTTAACG 
CTTGATCAAA AAATTCAAGA AACAAATTAT 
CAGGTTTTAG TAAATAAAGG ATATGGCTTT 
CCAAACACAA GGTTTCAGAT TGGCTCAATT 
AAAGCAATTG AAGAAGGTAA ACTTACATTA 
ATTCAAGGTG CTGAGGATAT TACGATTAGC 
TTATCAGCAA TGCCTAATAA TATCGTTACC 
AATACCATTC AAGTCAATAA AGGAAAATAC 
GCAGGAATGT TAGAGAAAAT GTATCAACGT 
CACAAAACGG CTGGTTTAAA GAATTTTGGC 
AATTCAACAA GTTATAAATG GACAGAAGAT 



CATGCAAAGA AAAGACATGG AGGAGTGAAT 
ATTGGTGGTA GTGGTTATTT AATAAAAACG 
CAAGAATCGA AAGTGGTCTT GGAAGAAGAT 
AAAGAAATAG TTGCACCAGA TAGTGGCGAA 
ATTGGTTCGG CTTTGATCAT TAAAGATGAT 
GCCAATTTTG AAAAGCAACA AGCCAACACG 
CAAAAATCTT TTACCACAAC CTTGATCTTA 
GATACAAAAC TCGCTACGTT TTATCCGCAA 
GATATGTTGA ATATGACAAG TGGTTTAAAG 
GATGAAGAAA TTATTCAATT TGTTAAACAA 
AATTATTCCC CAGTAAATTT TGTCCTTTTA 
ACCTATCAAG AATTATTTAA TAATCTTTAT 
TTCTATGAAA CCTTATTGGA ACAGCCCAAT 
AATTCATATA ACCAAGTGCT CTCAATTCCT 
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GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 
TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 
TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 
TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 
AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 
GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TTTAA 

EF131-2 (SEQ ID NO:490) 

MRKRH AKKRHGGVNW LFIVCLLWI GGSGYLIKTF FFTRDSQVSQ ESKWLEEDR 
RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP 
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD MLNMTSGLKL 
SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT YQELFNNLYH 
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG NVDMTTGDLY 
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIYVHDNYL RLHGVEAGQQ ALVLFSKDMK 
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF 

EF131-3 (SEQ ID NO:491) 

TTT AATAAAAACG 

TTCTTTTTCA CTAGAGATTC ACAAGTTAGT CAAGAATCGA AAGTGGTCTT GGAAGAAGAT 
CGCCGAAGTG ATAATTATGC GAATTTAACG AAAGAAATAG TTGCACCAGA TAGTGGCGAA 
CTTGATCAAA AAATTCAAGA AACAAATTAT ATTGGTTCGG CTTTGATCAT TAAAGATGAT 
CAGGTTTTAG TAAATAAAGG ATATGGCTTT GC CAATTTTG AAAAGCAACA AGCCAACACG 
CCAAACACAA GGTTTCAGAT TGGCTCAATT CAAAAATCTT TTACCACAAC CTTGATCTTA 
AAAGCAATTG AAGAAGGTAA ACTTACATTA GATACAAAAC TCGCTACGTT TTATCCGCAA 
ATTCAAGGTG CTGAGGATAT TACGATTAGC GATATGTTGA ATATGACAAG TGGTTTAAAG 
TTATCAGCAA TGCCTAATAA TATCGTTACC GATGAAGAAA TTATTCAATT TGTTAAACAA 
AATACCATTC AAGTCAATAA AGGAAAATAC AATTATTCCC CAGTAAATTT TGTCCTTTTA 
GCAGGAATGT TAGAGAAAAT GTATCAACGT ACCTATCAAG AATTATTTAA TAATCTTTAT 
CACAAAACGG CTGGTTTAAA GAATTTTGGC TTCTATGAAA CCTTATTGGA ACAGCCCAAT 
AATTCAACAA GTTATAAATG GACAGAAGAT AATTCATATA ACCAAGTGCT CTCAATTCCT 
GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 
TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 
TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 
TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 
AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 
GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TT 

EF131-4 (SEQ ID NO:492) 

LIKTF FFTRDSQVSQ ESKWLEEDR 

RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP 
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD MLNMTSGLKL 
SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT YQELFNNLYH 
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG NVDMTTGDLY 
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIYVHDNYL RLHGVEAGQQ ALVLFSKDMK 
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF 



EF132-1 (SEQ ID NO:493) 

TAGTTTTCTAATC TC AC C AAAAC AAAAATTTTTAAGAAAGAAGGAGAGATCGTTATGATGAG 
GTGGGAAGTCTGGGAATGTTGATTGCTCTTTTTATATTCGGGG 
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TABLE 1 . Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 

GC TTCGAAC GAAAAATTAAAGGTAGTAGTTAC TAATTCGATTTTAGC AG ATATTACTG AAAATATAGC AAAAGATAAA 
ATTGATTTACACAGTATCGTACCTATTGGGAAAGATCCCCACGAATATGAACCtTTGCCTGAAGATGTTCAAAAAACT 
TCAAAAGCAGATTTGATTTTTTATAACGGTG 

n^TGCGAACAAAGAGGAAAACAAAGACTATTTTGCAGCAAGTGATGGCATAGATGTTATTTA 
GAGAAAGGGAAGGAAGATCCCCATGCTTGGTTAAATTTAGAAAACGG 

TTAGCGGAAAAAGATCCTG ATAATAAAAAATTCTATAAAGAAAATCTAGATAAGTATATTG AAAAGTO TC TA 

GACAAAGAAGCTAAATCTAAATTTGCTTCAATTC 

TATTTCTCGAAAGCGTATAATGTGCCTTCTGCTTACATTTGGGAAAtCAACACTGAAGAAGAAGGAACACCAGATCAA 
ATAAAACACTTAGTTGAAAAATTACGCACAACAAAAGTTCCCTCCTTATTCGTAGAAAGTAGTGTGGACGATAGACCG 
ATGAAAACAGTATCAAAAGATACCAATATTCCTATCTATTCAACGATTTTTACTGATTCAATTGCAGA 
GATGGTGATAGTTACTATGCGATGATGAAATGGAACCTC 

EF132-2 (SEQ ID NO:494) 

MMRKWKVVVGSLGMLIALFIFGACSTNSKDKDTVASNEKLKVVVTNSILADITENIAKDKIDLHSIVPIGKDPHEYEP 
LPEDVQKTSKADL IFYNGVNLXTGGNAWFTKLVKXANKEENKDYFAASDG I DV I YLEGQSEKGKEDPH AWLNLENG I I 
YAKNIEKWLAEKDPDNKKFYKENLDKYIEKLDSLDKEAKSKFASIPNDKKMIVTSEGCFKYFSKAYNVPSAYIWEINT 
EEEGTPDQIKHLVEKLRTTKVPSLFVESSVDDRPMKTVSKDTNIPIYSTIFTDSIAEKGQIX3DSYYAMMKWNLDKIAE 
GLSK. 

EF132-3 (SEQ ID NO:495) 

ATGTTCAACAAATAGTAAAGACAAAGATACAGTGGCTTCGAACGAAAAATTAAAGGTAGTAGTTACTAATTCGAT^ 

AGCAGATATTACTGAAAATATAGCAAAAGATAAAATTGATTTACACAGTATCGTACCTATTGGGAAAGATCCCCACGA 

ATATGAACCtTTGCCTGAAGATGTTCAAAAAACTTCAAAAGCAGATTTGATTTTTTATAACGGTC 

TGGAGGAAATGCTTGGTTTACAAAATTAGTAAAAmATGCGAACAAAGAGGAAAACAAAGACTATTTTGCAGCAAGTGA 

TGGCATAGATGTTATTTACTTAGAGGGTCAGAGTGAGAAAGGGAAGGAAGATCCCCATGCTTGGTTAAATTTAGAAAA 

CGGTATTATTTACGCTAAAAATATTGAAAAATGGTTAGCGGAAAAAGATCCTGATAATAAAAAATTCTATAAAGAAAA 

TCTAGATAAGTATATTG AAAAGTTGGATTC TC TAGAC AAAGAAGC TAAATC TAAATTTGC TTC AATTCCGAATGATAA 

AAAAATGATTGTTACAAGTGAAGGATGCTTtAAATATTTCTCGAAAGCGTATAATGTGCCTTCTGCTTACATTTGGGA 

AAtCAACACTGAAGAAGAAGGAACACCAGATCAAATAAAACACTTAGTTGAAAAATTACGCACAACAAAAGTTCCCTC 

C TTATTCGTAGAAAGTAGTGTGGACGATAGAC CGATGAAAAC AGTATC AA71AGATAC C AATATTC C TATC TATTC AAC 

GATTTTTACTGATTCAATTGCAGAAAAAGGACAAGATGGTGATAGTTACTATGCGATGATGAAATGGAACCTGGATAA 

AATTGCTGAAGGCCTTTCGAAA 



EF132-4 (SEQ ID NO:496) 

C STNSKDKDTVASNEKLK VWTNS I L AD ITEN I AKDK I DLH S I VP I GKDPHE YEPLPEDVQKTSKADL I F YNGVNLXT 
GGNAWFTKLVKXANKEENKDYFAASDGIDVIYLEGQSEKGK^ 

LDKYIEKLDSLDKEAKSKFASIPNDKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKHLVEKLRTTKVPS 
LFVESSVDDRPMKTVSKDTNIPIYSTIFTDSIAEKGQDGDSYYAMMKWNLDKIAEGLSK 
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Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases 
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S.thermophilus exopolysaccharide biosynthesis protein EpsR. 


S.thermophilus exopolysaccharide synthesis operon epsA gene 
product. 


Helicobacter-specific ATPase 439. 


Mycobacterium BCG immunogen. 


Helicobacter-specific ATPase 948 (ORF-4). 


Rat homologue of human Wilson disease gene ATP7B. 


Wilson disease protein ATP7B. 


Product of the sscl gene. 


Flea sodium pump alpha subunit. 


H. pylori transporter protein, 14ce20219orfl. 


Bacteroides fragilis RprX regulatory response protein. 


Tomato TGETRl ethylene response protein. 


Ethylene response (ETR) gene product. 


Ethylene response (ETR) mutant protein etrl-1. 


Ethylene response (ETR) mutant protein etrl-2. 


Ethylene response (ETR) mutant protein etrl-3. 


Ethylene response (ETR) mutant protein etrl-4. 


Regulatory protein VanS involved in glycopeptide resistance. 


Penicillin binding protein PBP2A-epi. 


Penicillin binding protein PBP2A-27R. 


Penicillin binding protein derivative #1. 


Penicillin binding protein derivative #2. 
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PCT/US98/08959 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lvsine 




Hi uridine 


Acidic 

iivlUlv 


Asnartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 
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Table 4. Residues Comprising Antigenic Epitope-Bearing Portion. 



EF001-2 


from about Asp-150 to about Lys-152, from about Ser-256 to about Tyr- 
259, from about Lys-360 to about Lys-363, from about Asn-406 to about 
Asp-408. 






EF002-2 


from about Asp-80 to about Asp-83, from about Asp-281 to about Gly- 
283. 






EF003-2 


from about Asn-263 to about Gly-266. 






EF004-2 


from about Asn-23 to about Asn-26, from about Lys-83 to about Ser-87, 
from about Tyr-154 to about Asp-159. 






EF005-2 


from about Lys-249 to about Glu-252. 






EF006-2 


from about Gly-23 to about Asp-28. 






EF008-2 


from about Thr-92 to about Gly-94, from about Pro- 161 to about Asp- 
165, from about Gly-287 to about Thr-289. 






EFO 10-2 


from about Pro- 129 to about Asn-131. 






EF012-2 


from about Asp-77 to about Asp-79, from about Asp-94 to about Lys-98, 
from about Asp-256 to about Thr-258, from about Glu-461 to about Asn- 
468. 






EFO 1.3-2 


from about Thr-30 to about Asp-32, from about Glu-73 to about Ala-75, 
from about Gin- 164 to about Asn-166, from about Lys-193 to about Gly- 
195. 






EF014-2 


from about Ser-203 to about Asp-206, from about Gln-314 to about Gly- 
316 






EF015-2 


from about Pro-66 to about Gly-69. 






EF016-2 


from about Lys-236 to about Asn-239. 






EFO 17-2 


from about Ser-90 to about Gly-93, from about Thr-197 to about Lys- 
199, from about Lys-230 to about Asn-233, from about Ser-428 to about 
Gly-431. 






EFO 18-2 


from about Lys-159 to about Tyr-161, from about Asn-165 to about Ser- 
167, from about Asn-250 to about Arg-256, from about Asn-392 to about 
Gly-395, from about Lys-416 to about Tyr-418, from about Asn-428 to 
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about Arg-430. 






EF019-2 


from about Arg-209 to about Ser-21 1, from about Lys-287 to about Ser- 
290. 






EF020-2 


from about Lys-57 to about Asn-62. 






EF021-2 


from about Ser-33 to about Gly-35, from about Glu-77 to about Gly-81, 
from about Asp-139 to about Lys-141, from about Glu-255 to about Ser- 
258, from about Gln-271 to about Tyr-277. 






EF023-2 i 


from about Lys-232 to about Asp-234, from about Arg-304 to about Gly- 
306, from about Thr-453 to about Arg-456, from about Ser-478 to about 
Thr-480. 






EF025-2 


from about Arg-183 to about Asp-185. 






EF026-2 


from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, 
from about Gin- 107 to about Asn-1 10. 






EF027-2 


from about Gln-72 to about Lys-74, from about Lys-229 to about Asp- 
231. 






EF028-2 


from about Asp-186 to about Gin- 188. 






EF029-2 


from about Asp-1 18 to about Lys-122, from about Asp-124 to about 
Tyr-126. 






EF031-2 


from about Glu-30 to about Gly-33. 






EF034-2 


from about Glu-25 to about Gly-27, from about Glu-75 to about Thr-77. 










EF36-2 


from about Gin- 177 to about Ser-179. 






EF037-2 


from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, 
from about Gln-107 to about Asn-1 10. 






EF038-2 


from about Asn-77 to about Lys-79, from about Tyr-88 to about Asn-92. 






EF040-2 


from about Lys-167 to about Gly-172, from about Lys-240 to about 
Asn-242. 







WO 98/50554 PCT/US98/08959 

291 



Table 4. Residues Comprising Antigenic Epitope-Bearing Portion. 



EF044-2 


from about Arg-192 to about Gly-194, from about Asn-200 to about. Asn- 
203. 






EF045-2 


from about Asp-159 to about Asn-161, from about His-172 to about Gly- 
174, from about Tyr-261 to about Gly-264, from about Lys-305 to about 
Glu-308. 






EF046-2 


from about Ser-1 8 to about Gly-23, from about Gln-41 to about Ser-47, 
from about Thr-76 to about Asp-78. 






EF047-2 


from about Asn-28 to about Asp-30, from about Asp-273 to about Asn- 
277. 






EF048-2 


from about Asp-138 to about Lys-141, from about Asp-152 to about 
Gly-154. 






EF051-2 


from about Asp-73 to about Gly-76. 






EF053-2 


from about Ser-79 to about Gly-82. 






EF055-2 


from about Asp-26 to about Gly-28, from about Gln-67 to about Asp-69, 
from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89. 






EF056-2 


from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89. 










EF058-2 


from about Lys-129 to about Gly-133, from about Gln-571 to about Tyr- 
573, from about Pro-586 to about Gly-591 . 






EF065-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF066-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF067-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 
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EF073-2 


from about Met-98 to about Arg-100, from about Arg-1 10 to about Asp- 
112. 






EF074-2 


from about Ser-53 to about Tyr-59, from about Ser-86 to about Gly-88, 
from about Pro-97 to about Gin- 100, from about Gln-230 to about Gly- 
232. 






EF076-2 


from about Asn-38 to about Tyr-40, from about Asp-48 to about Asn-53, 
from about Lys-79 to about Gly-8 1 . 






EF077-2 


from about Arg-41 1 to about Gly-413. 






EF078-2 


from about Thr-294 to about Gly-296, from about Asp-366 to about Gln- 
368, from about Glu-524 to about Gly-526. 






EF080-2 


from about Glu-164 to about Gly-166, from about Ser-206 to about Tyr- 
208, from about Lys-239 to about Gly-243. 






EF081-2 


from about Asn-7 to about Ser-1 1, from about Lys-77 to about Tyr-80, 
from about Lys-1 12 to about Asn-1 14, from about Gly-1 62 to about Asp- 
164, from about Arg-1 81 to about Gly-183. 






EF083-2 


from about Gln-38 to about Arg-40. 






EF084-2 


from about Lys-1 40 to about Asp- 142, from about Gly-164 to about Arg- 
166, from about Arg-262 to about Gly-264. 






EF085-2 


from about Asn-95 to about Asp-97, from about Arg-1 12 to about Asp- 
1 14, from about Asp-258 to about Ser-260, from about Arg-40 1 to about 
Ser-403. 






EF086-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 






EF087-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 






EF088-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 
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EF090-2 


from about Arg-2 to about Arg-5. 






EF091-2 


from about Gln-40 to about Asp-43. 






EF093-2 


from about Lys-95 to about Gly-97. 






EF094-2 


from about Asp-314 to about Asp-316. 






EF095-2 


from about Ser-328 to about Thr-330, from about Asp-359 to about Asp- 
363, from about Glu-637 to about Gly-639, from about Asn-744 to about 
Gly-746. 






EF096-2 


from about Pro-128 to about Asn-130, from about Ser-193 to about Asp- 
196. 






EF097-2 


from about Val-357 to about Gly-359. 






EF099-2 


from about Glu-44 to about Asp-47, from about Lys-154 to about Gly- 
156, from about Asn-286 to about Asp-289. 






EF101-2 


from about Lys-40 to about Asp-42, from about Pro-255 to about Asn- 
258, from about Lys-288 to about Gly-290. 






EF 102-2 


from about Asp-314 to about Asp-316. 






EF103-2 


from about Asn-46 to about Gly-48. 






EF 104-2 


from about Pro-232 to about Lys-237, from about Ala-362 to about Asn- 
366, from about Ser-421 to about Gly-423, from about Lys-488 to about 
Ser-490, from about Asp-550 to about Asn-552, from about Pro-637 to 
about Lys-640, from about Asp-727 to about Gly-729, from about Asn- 
751 to about Ser-754, from about Lys-771 to about Asn-774, from about 
lle-835 to about Asn-837, from about Pro-851 to about Gly-853. 






EF105-2 


from about Ser-40 to about Gly-43, from about Asn-94 to about Gln-97, 
from about Gln-220 to about Gly-222, from about Asn-263 to about Gly- 
265. 






EF 106-2 


from about Asp-72 to about Gly-75, from about Thr-274 to about Asp- 
277, from about Asn-310 to about Arg-313. 






EF 107-2 


from about Thr-155 to about Asn-157, from about Thr-189 to about Asp- 
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191, from about Arg-270 to about Gly-272, from about Thr-330 to about 
Lys-335, from about Asp-365 to about Asp-368, from about Pro-451 to 
about Asp-453, from about Gly-485 to about Thr-488. 






EF108-2 


from about Lys-142 to about Trp-145, from about Thr-147 to about Tyr- 
1 50, from about Arg-212 to about Gly-214, from about Ser-248 to about 
Asp-251, from about Asp-384 to about Asp-387, from about Pro-481 to 
about Arg-483, from about Lys-491 to about Gly-494, from about Thr- 
619 to about Gly-624, from about Asp-656 to about Asp-659, from about 
Lys-717 to about Asn-721, from about Ser-822 to about Gly-824, from 
about Tyr-1 137 to about Thr-1 141 . 






EF110-2 


from about Pro- 123 to about Gly-127, from about Thr-223 to about Gly- 
225. 






EF111-2 


from about Lys-207 to about Asn-209, from about Asp-245 to about 
Asn-248, from about Lys-396 to about Asp-398, from about Glu-429 to 
about Ser-432, from about Thr-470 to about His-474. 






EF119-2 


from about Asp-90 to about Asn-92, from about Gin- 142 to about Gly- 
144. 






EF121-2 


from about Asn-159 to about Asp-161, from about Asn-351 to about 
Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to 
about Ser-789. 






EF122-2 


from about Asn- 1 59 to about Asp- 161, from about Asn-35 1 to about 1 
Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to 
about Ser-789. : 






EF123-2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp-1074 to 
about Gly-1076, from about Asp- 1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
from about Lys-1656 to about Asp- 1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1942. 






EF124-2 


from about Asn-33 1 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp-1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
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from about Lys-1656 to about Asp-1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1942. 






EF125-2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Scr-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp-1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
from about Lys-1656 to about Asp-1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1942. 






EF126-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491 , from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF127-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF128-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF129-2 


from about Asn-300 to about Gly-302, from about Ser-316 to about Gly- 
319, from about Asn-385 to about His-387 






EF131-2 


from about Lys-201 to about Tyr-204, from about Glu-263 to about Ser- 
266. 






EF132-2 


from about Thr-26 to about Ser-28. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \lbis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 10 .line 12 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet r-j 



Narr.' of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manasas, Virginia 201 10-2209 
United Slates of America 



Date of deposit May 2, 1997 



Accession Number 55969 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submined to the international Bureau later (specify the general nature of the indications, e.g., "Accession 
Number of Deposit") 



For receiving Office use only . 



w 



Authorized 



This sheet was received with the international application 
d officer \S 



• For International Bureau use onlv > 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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What Is Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide having a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding any one of the amino acid sequences of the 
polypeptides shown in Table 1; or 

(b) a nucleotide sequence complementary to any one of the nucleotide sequences in (a). 

(c) a nucleotide sequence at least 95% identical to any one of the nucleotide sequences 
shown in Table 1 ; or, 

(d) a nucleotide sequence at least 95% identical to a nucleotide sequence complementary 
to any one of the nucleotide sequences shown in Table 1. 

2. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which 
hybridizes under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence identical to a nucleotide sequence in (a) or (b) of claim 1 . 

3. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which 
encodes an epitope-bearing portion of a polypeptide in (a) of claim 1 . 

4. The isolated nucleic acid molecule of claim 3, wherein said epitope-bearing portion 
of a polypeptide comprises an amino acid sequence listed in Table 4. 

5. A method for making a recombinant vector comprising inserting an isolated nucleic 
acid molecule of claim 1 into a vector. 

6. A recombinant vector produced by the method of claim 5. 

7. A host cell comprising the vector of claim 6. 

8. A method of producing a polypeptide comprising: 

(a) growing the host cell of claim 7 such that the protein is expressed by the cell; and 

(b) recovering the expressed polypeptide. 

9. An isolated polypeptide comprising a polypeptide selected from the group 
consisting of: 

(a) a polypeptide consisting of one of the complete amino acid sequences of Table 1 ; 

(b) a polypeptide consisting of one the complete amino acid sequences of Table 1 except 
the N-terminal residue; 
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(c) a fragment of the polypeptide of (a) having biological activity; and 

(d) a fragment of the polypeptide of (a) which binds to an antibody specific for the 
polypeptide of (a). 

10. An isolated antibody specific for the polypeptide of claim 9. 

1 1. A polypeptide produced according to the method of claim 8. 

12. An isolated polypeptide comprising an amino acid sequence at least 95% identical to 
a sequence selected from the group consisting of an amino acid sequence of any one of 
the polypeptides in Table 1. 

13. An isolated polypeptide antigen comprising an amino acid sequence of an £. 
faecalis epitope shown in Table 4. 

14. An isolated nucleic acid molecule comprising a polynucleotide with a nucleotide 
sequence encoding a polypeptide of claim 9. 

15. A hybridoma which produces an antibody of claim 10. 

16. A vaccine* comprising: 

(1) one or more E. faecalis polypeptides selected from the group consisting of a 
polypeptide of claim 9; and 

(2) a pharmaceutical^ acceptable diluent, carrier, or excipient; 

wherein said polypeptide is present, in an amount effective to elicit protective 
antibodies in an animal to a member of the Enterococcus genus. 

17. A method of preventing or attenuating an infection caused by a member of the 
Enterococcus genus in an animal, comprising administering to said animal a polypeptide 
of claim 9, wherein said polypeptide is administered in an amount effective to prevent 
or attenuate said infection. 

18. A method of detecting Enterococcus nucleic acids in a biological sample 
comprising: 

(a) contacting the sample with one or more nucleic acids of claim 1, under conditions 
such that hybridization occurs, and 

(b) detecting hybridization of said nucleic acids to the one or more Enterococcus 
nucleic acid sequences present in the biological sample. 
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19. A method of detecting Enterococcus nucleic acids in a biological sample obtained 
from an animal, comprising: 

(a) amplifying one or more Enterococcus nucleic acid sequences in said sample using 
polymerase chain reaction, and 

(b) detecting said amplified Enterococcus nucleic acid. 

20. A kit for detecting Enterococcus antibodies in a biological sample obtained from an 
animal, comprising 

(a) a polypeptide of claim 9 attached to a solid support; and 

(b) detecting means. 

21. A method of detecting Enterococcus antibodies in a biological sample obtained 
from an animal, comprising 

(a) contacting the sample with a polypeptide of claim 9; and 

(b) detecting antibody-antigen complexes. 
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effects of the compound/composition. 
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an extent that no meaningful International Search can be carried out, specifically: 

Further defects(s) under article 17(2)(a): 

The gene EF078 which is mentioned in Table 4, is not cited in Table 1 
and is also absent from the sequence listing. 

I Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



□ As ail required additional search fees were timely paid by the applicant, this International Search Report covers all 
searchable claims. 



2. I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
I 1 covers only those claims for which fees were paid, specifically claims Nos.: 



4. X No required additional search fees were timely paid by the applicant. Consequently, this International Search Report i 
". 1 1 restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



See extra sheet, Invention 1. 

Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

f^J No protest accompanied the payment of additional search fees. 
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inventions 7 to 41: Claims: (1-21) partially 

Idem as invention 1, but concerning EF008 to EF0042 

Inventions 42 to 74: Claims: (1-21) partially 

Idem as invention 1, but concerning EF045 to EF077 

Inventions 75 to 107: Claims: (1-21) partially 

Idem as invention 1, but concerning EF079 to EF111 

Inventions 108 to 123: Claims: (1-21) partially 

Idem as invention 1, but concerning EF117 to EF132 

Invention 124: Claim: 13 partially 

An isolated polypeptide antigen comprising an amino acid 
sequence of an Enterococcus faecal is epitope of EF078 shewn in 
Table 4. 

For the sake of conciseness, the first subject matter is explicitly 
:a" : ned, the other subject matters are defined by analogy thereto. 
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